Graph Clustering via Variational Graph Embedding

Graph clustering is a fundamental task which discovers communities or groups in networks. Recent studies have mostly focused on developing deep learning approaches to learn a compact graph embedding, upon which classic clustering methods like k-means or spectral clustering algorithms are applied. These two-step frameworks are difficult to manipulate and usually lead to suboptimal performance, mainly because the graph embedding is not goal-directed, i.e., designed for the specific clustering task. In this paper, we propose a goal-directed deep learning approach, Deep Attentional Embedded Graph Clustering (DAEGC for short). Our method focuses on attributed graphs to sufficiently explore the two sides of information in graphs. By employing an attention network to capture the importance of the neighboring nodes to a target node, our DAEGC algorithm encodes the topological structure and node content in a graph to a compact representation, on which an inner product decoder is trained to reconstruct the graph structure. Furthermore, soft labels from the graph embedding itself are generated to supervise a self-training graph clustering process, which iteratively refines the clustering results. The self-training process is jointly learned and optimized with the graph embedding in a unified framework, to mutually benefit both components. Experimental results compared with state-of-the-art algorithms demonstrate the superiority of our method.

Download Full-text

Variational Graph Embedding and Clustering with Laplacian Eigenmaps

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/297 ◽

2019 ◽

Author(s):

Zitai Chen ◽

Chuan Chen ◽

Zong Zhang ◽

Zibin Zheng ◽

Qingsong Zou

Keyword(s):

Real World ◽

Graph Embedding ◽

Graph Clustering ◽

Stochastic Gradient Descent ◽

Laplacian Eigenmaps ◽

Generative Process ◽

Clustering Problem ◽

Teacher Student ◽

Mixture Of Gaussian ◽

Node Embeddings

As a fundamental machine learning problem, graph clustering has facilitated various real-world applications, and tremendous efforts had been devoted to it in the past few decades. However, most of the existing methods like spectral clustering suffer from the sparsity, scalability, robustness and handling high dimensional raw information in clustering. To address this issue, we propose a deep probabilistic model, called Variational Graph Embedding and Clustering with Laplacian Eigenmaps (VGECLE), which learns node embeddings and assigns node clusters simultaneously. It represents each node as a Gaussian distribution to disentangle the true embedding position and the uncertainty from the graph. With a Mixture of Gaussian (MoG) prior, VGECLE is capable of learning an interpretable clustering by the variational inference and generative process. In order to learn the pairwise relationships better, we propose a Teacher-Student mechanism encouraging node to learn a better Gaussian from its instant neighbors in the stochastic gradient descent (SGD) training fashion. By optimizing the graph embedding and the graph clustering problem as a whole, our model can fully take the advantages in their correlation. To our best knowledge, we are the first to tackle graph clustering in a deep probabilistic viewpoint. We perform extensive experiments on both synthetic and real-world networks to corroborate the effectiveness and efficiency of the proposed framework.

Download Full-text

An Efficient Method for Attributed Graph Clustering

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2013.01704 ◽

2014 ◽

Vol 36 (8) ◽

pp. 1704-1713 ◽

Cited By ~ 1

Author(s):

Ye WU ◽

Zhi-Nong ZHONG ◽

Wei XIONG ◽

Luo CHEN ◽

Ning JING

Keyword(s):

Efficient Method ◽

Graph Clustering ◽

Attributed Graph

Download Full-text

A Transition-Based Knowledge Graph Embedding with Adapting New Entities

Contemporary Research Trend of IT Convergence Technology ◽

10.21742/asehl.2016.4.03 ◽

2016 ◽

Author(s):

A-Yeong Kim ◽

◽

Hee-Guen Yoon ◽

Seong-Bae Park ◽

Se-Young Park ◽

...

Keyword(s):

Graph Embedding ◽

Knowledge Graph

Download Full-text

An unsupervised classification approach for hyperspectral images based adaptive spatial and spectral neighborhood selection and graph clustering

Procedia Computer Science ◽

10.1016/j.procs.2018.08.008 ◽

2018 ◽

Vol 126 ◽

pp. 743-750 ◽

Cited By ~ 2

Author(s):

Manel Ben Salem ◽

Karim Saheb Ettabaa ◽

Med Salim Bouhlel

Keyword(s):

Unsupervised Classification ◽

Graph Clustering ◽

Hyperspectral Images ◽

Classification Approach ◽

Neighborhood Selection ◽

Spectral Neighborhood

Download Full-text

Asymmetric Semi-Nonnegative Matrix Factorization for Directed Graph Clustering

2020 10th International Conference on Computer and Knowledge Engineering (ICCKE) ◽

10.1109/iccke50421.2020.9303649 ◽

2020 ◽

Author(s):

Reyhaneh Abdollahi ◽

Seyed Amjad Seyedi ◽

Mohamad Reza Noorimehr

Keyword(s):

Directed Graph ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix ◽

Graph Clustering

Download Full-text

LineaRE: Simple but Powerful Knowledge Graph Embedding for Link Prediction

2020 IEEE International Conference on Data Mining (ICDM) ◽

10.1109/icdm50108.2020.00051 ◽

2020 ◽

Author(s):

Yanhui Peng ◽

Jing Zhang

Keyword(s):

Link Prediction ◽

Graph Embedding ◽

Knowledge Graph ◽

Powerful Knowledge

Download Full-text

Graph Learning for Combinatorial Optimization: A Survey of State-of-the-Art

Data Science and Engineering ◽

10.1007/s41019-021-00155-3 ◽

2021 ◽

Author(s):

Yun Peng ◽

Byron Choi ◽

Jianliang Xu

Keyword(s):

Machine Learning ◽

Combinatorial Optimization ◽

Graph Embedding ◽

Partial Solution ◽

Complex Data ◽

Learning Methods ◽

Graph Learning ◽

Second Stage ◽

End To End ◽

Embedding Methods

AbstractGraphs have been widely used to represent complex data in many applications, such as e-commerce, social networks, and bioinformatics. Efficient and effective analysis of graph data is important for graph-based applications. However, most graph analysis tasks are combinatorial optimization (CO) problems, which are NP-hard. Recent studies have focused a lot on the potential of using machine learning (ML) to solve graph-based CO problems. Most recent methods follow the two-stage framework. The first stage is graph representation learning, which embeds the graphs into low-dimension vectors. The second stage uses machine learning to solve the CO problems using the embeddings of the graphs learned in the first stage. The works for the first stage can be classified into two categories, graph embedding methods and end-to-end learning methods. For graph embedding methods, the learning of the the embeddings of the graphs has its own objective, which may not rely on the CO problems to be solved. The CO problems are solved by independent downstream tasks. For end-to-end learning methods, the learning of the embeddings of the graphs does not have its own objective and is an intermediate step of the learning procedure of solving the CO problems. The works for the second stage can also be classified into two categories, non-autoregressive methods and autoregressive methods. Non-autoregressive methods predict a solution for a CO problem in one shot. A non-autoregressive method predicts a matrix that denotes the probability of each node/edge being a part of a solution of the CO problem. The solution can be computed from the matrix using search heuristics such as beam search. Autoregressive methods iteratively extend a partial solution step by step. At each step, an autoregressive method predicts a node/edge conditioned to current partial solution, which is used to its extension. In this survey, we provide a thorough overview of recent studies of the graph learning-based CO methods. The survey ends with several remarks on future research directions.

Download Full-text