Research on spectral clustering algorithms based on building different affinity matrix

Calculating and monitoring customer churn metrics is important for companies to retain customers and earn more profit in business. In this study, a churn prediction framework is developed by modified spectral clustering (SC). However, the similarity measure plays an imperative role in clustering for predicting churn with better accuracy by analyzing industrial data. The linear Euclidean distance in the traditional SC is replaced by the non-linear S-distance (Sd). The Sd is deduced from the concept of S-divergence (SD). Several characteristics of Sd are discussed in this work. Assays are conducted to endorse the proposed clustering algorithm on four synthetics, eight UCI, two industrial databases and one telecommunications database related to customer churn. Three existing clustering algorithms—k-means, density-based spatial clustering of applications with noise and conventional SC—are also implemented on the above-mentioned 15 databases. The empirical outcomes show that the proposed clustering algorithm beats three existing clustering algorithms in terms of its Jaccard index, f-score, recall, precision and accuracy. Finally, we also test the significance of the clustering results by the Wilcoxon’s signed-rank test, Wilcoxon’s rank-sum test, and sign tests. The relative study shows that the outcomes of the proposed algorithm are interesting, especially in the case of clusters of arbitrary shape.

Download Full-text

Research on Spectral Clustering

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.687-691.1350 ◽

2014 ◽

Vol 687-691 ◽

pp. 1350-1353

Author(s):

Li Li Fu ◽

Yong Li Liu ◽

Li Jing Hao

Keyword(s):

Spectral Clustering ◽

Clustering Algorithm ◽

Theoretical Foundation ◽

Clustering Algorithms ◽

Spectral Graph Theory ◽

Graph Partition ◽

Mining Areas ◽

Spectral Graph ◽

Definition Of ◽

Spectral Clustering Algorithm

Spectral clustering algorithm is a kind of clustering algorithm based on spectral graph theory. As spectral clustering has deep theoretical foundation as well as the advantage in dealing with non-convex distribution, it has received much attention in machine learning and data mining areas. The algorithm is easy to implement, and outperforms traditional clustering algorithms such as K-means algorithm. This paper aims to give some intuitions on spectral clustering. We describe different graph partition criteria, the definition of spectral clustering, and clustering steps, etc. Finally, in order to solve the disadvantage of spectral clustering, some improvements are introduced briefly.

Download Full-text

Spectral clustering on protein-protein interaction networks via constructing affinity matrix using attributed graph embedding

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2021.104933 ◽

2021 ◽

pp. 104933

Author(s):

Kamal Berahmand ◽

Elahe Nasiri ◽

Rojiar Pir mohammadiani ◽

Yuefeng Li

Keyword(s):

Protein Interaction ◽

Spectral Clustering ◽

Graph Embedding ◽

Protein Interaction Networks ◽

Interaction Networks ◽

Affinity Matrix ◽

Protein Protein Interaction ◽

Attributed Graph ◽

Protein Protein Interaction Networks

Download Full-text

Research and development of spectral clustering algorithms

International Journal of Collaborative Intelligence ◽

10.1504/ijci.2016.084114 ◽

2016 ◽

Vol 1 (4) ◽

pp. 275

Author(s):

Ling Ding

Keyword(s):

Research And Development ◽

Spectral Clustering ◽

Clustering Algorithms

Download Full-text

A pareto ensemble based spectral clustering framework

Complex & Intelligent Systems ◽

10.1007/s40747-020-00215-7 ◽

2020 ◽

Author(s):

Juanjuan Luo ◽

Huadong Ma ◽

Dongqing Zhou

Keyword(s):

Phase I ◽

Phase Ii ◽

Spectral Clustering ◽

Clustering Algorithms ◽

Divide And Conquer ◽

Nonzero Entry ◽

Similarity Matrix ◽

Diversity Preservation ◽

Two Phases ◽

Matrix Construction

Abstract Similarity matrix has a significant effect on the performance of the spectral clustering, and how to determine the neighborhood in the similarity matrix effectively is one of its main difficulties. In this paper, a “divide and conquer” strategy is proposed to model the similarity matrix construction task by adopting Multiobjective evolutionary algorithm (MOEA). The whole procedure is divided into two phases, phase I aims to determine the nonzero entries of the similarity matrix, and Phase II aims to determine the value of the nonzero entries of the similarity matrix. In phase I, the main contribution is that we model the task as a biobjective dynamic optimization problem, which optimizes the diversity and the similarity at the same time. It makes each individual determine one nonzero entry for each sample, and the encoding length decreases to O(N) in contrast with the non-ensemble multiobjective spectral clustering. In addition, a specific initialization operator and diversity preservation strategy are proposed during this phase. In phase II, three ensemble strategies are designed to determine the value of the nonzero value of the similarity matrix. Furthermore, this Pareto ensemble framework is extended to semi-supervised clustering by transforming the semi-supervised information to constraints. In contrast with the previous multiobjective evolutionary-based spectral clustering algorithms, the proposed Pareto ensemble-based framework makes a balance between time cost and the clustering accuracy, which is demonstrated in the experiments section.

Download Full-text

Ship-handling behavior pattern recognition using AIS sub-trajectory clustering analysis based on the T-SNE and spectral clustering algorithms

Ocean Engineering ◽

10.1016/j.oceaneng.2020.106919 ◽

2020 ◽

Vol 205 ◽

pp. 106919 ◽

Cited By ~ 3

Author(s):

Miao Gao ◽

Guo-You Shi

Keyword(s):

Pattern Recognition ◽

Clustering Analysis ◽

Spectral Clustering ◽

Behavior Pattern ◽

Clustering Algorithms ◽

Trajectory Clustering ◽

Ship Handling

Download Full-text

An Improved Spectral Clustering Community Detection Algorithm Based on Probability Matrix

Discrete Dynamics in Nature and Society ◽

10.1155/2020/4540302 ◽

2020 ◽

Vol 2020 ◽

pp. 1-6

Author(s):

Shuxia Ren ◽

Shubo Zhang ◽

Tao Wu

Keyword(s):

Community Detection ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Transition Probability ◽

Clustering Algorithms ◽

Detection Algorithm ◽

Community Information ◽

The Mean ◽

Community Detection Algorithm ◽

Spectral Clustering Algorithm

The similarity graphs of most spectral clustering algorithms carry lots of wrong community information. In this paper, we propose a probability matrix and a novel improved spectral clustering algorithm based on the probability matrix for community detection. First, the Markov chain is used to calculate the transition probability between nodes, and the probability matrix is constructed by the transition probability. Then, the similarity graph is constructed with the mean probability matrix. Finally, community detection is achieved by optimizing the NCut objective function. The proposed algorithm is compared with SC, WT, FG, FluidC, and SCRW on artificial networks and real networks. Experimental results show that the proposed algorithm can detect communities more accurately and has better clustering performance.

Download Full-text

Lifelong Spectral Clustering

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6045 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5867-5874

Author(s):

Gan Sun ◽

Yang Cong ◽

Qianqian Wang ◽

Jun Li ◽

Yun Fu

Keyword(s):

Machine Learning ◽

Real World ◽

Spectral Clustering ◽

State Of The Art ◽

Clustering Algorithms ◽

Orthogonal Basis ◽

Learning Framework ◽

The Past ◽

Benchmark Datasets ◽

Over Time

In the past decades, spectral clustering (SC) has become one of the most effective clustering algorithms. However, most previous studies focus on spectral clustering tasks with a fixed task set, which cannot incorporate with a new spectral clustering task without accessing to previously learned tasks. In this paper, we aim to explore the problem of spectral clustering in a lifelong machine learning framework, i.e., Lifelong Spectral Clustering (L2SC). Its goal is to efficiently learn a model for a new spectral clustering task by selectively transferring previously accumulated experience from knowledge library. Specifically, the knowledge library of L2SC contains two components: 1) orthogonal basis library: capturing latent cluster centers among the clusters in each pair of tasks; 2) feature embedding library: embedding the feature manifold information shared among multiple related tasks. As a new spectral clustering task arrives, L2SC firstly transfers knowledge from both basis library and feature library to obtain encoding matrix, and further redefines the library base over time to maximize performance across all the clustering tasks. Meanwhile, a general online update formulation is derived to alternatively update the basis library and feature library. Finally, the empirical experiments on several real-world benchmark datasets demonstrate that our L2SC model can effectively improve the clustering performance when comparing with other state-of-the-art spectral clustering algorithms.

Download Full-text