ALLEVIATING THE SPARSITY PROBLEM OF COLLABORATIVE FILTERING USING AN EFFICIENT ITERATIVE CLUSTERED PREDICTION TECHNIQUE

Collaborative filtering (CF) is one of the most prevalent recommendation techniques, providing personalized recommendations to users based on their previously expressed preferences and those of other similar users. Although CF has been widely applied in various applications, its applicability is restricted due to the data sparsity, the data inadequateness of new users and new items (cold start problem), and the growth of both the number of users and items in the database (scalability problem). In this paper, we propose an efficient iterative clustered prediction technique to transform user-item sparse matrix to a dense one and overcome the scalability problem. In this technique, spectral clustering algorithm is utilized to optimize the neighborhood selection and group the data into users' and items' clusters. Then, both clustered user-based and clustered item-based approaches are aggregated to efficiently predict the unknown ratings. Our experiments on MovieLens and book-crossing data sets indicate substantial and consistent improvements in recommendations accuracy compared to the hybrid user-based and item-based approach without clustering, hybrid approach with k-means and singular value decomposition (SVD)-based CF. Furthermore, we demonstrated the effectiveness of the proposed iterative technique and proved its performance through a varying number of iterations.

Download Full-text

Privacy-preserving constrained spectral clustering algorithm for large-scale data sets

IET Information Security ◽

10.1049/iet-ifs.2019.0255 ◽

2020 ◽

Vol 14 (3) ◽

pp. 321-331 ◽

Cited By ~ 1

Author(s):

Ji Li ◽

Jianghong Wei ◽

Mao Ye ◽

Wenfen Liu ◽

Xuexian Hu

Keyword(s):

Spectral Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Privacy Preserving ◽

Data Sets ◽

Large Scale Data ◽

Spectral Clustering Algorithm ◽

Scale Data ◽

Large Scale Data Sets

Download Full-text

Spectral Clustering Algorithm Based on Improved Gaussian Kernel Function and Beetle Antennae Search with Damping Factor

Computational Intelligence and Neuroscience ◽

10.1155/2020/1648573 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9

Author(s):

Zhe Zhang ◽

Xiyu Liu ◽

Lin Wang

Keyword(s):

Kernel Function ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Gaussian Kernel ◽

Damping Factor ◽

Data Sets ◽

Similarity Matrix ◽

Scale Parameters ◽

Gaussian Kernel Function ◽

Spectral Clustering Algorithm

There are two problems in the traditional spectral clustering algorithm. Firstly, when it uses Gaussian kernel function to construct the similarity matrix, different scale parameters in Gaussian kernel function will lead to different results of the algorithm. Secondly, K-means algorithm is often used in the clustering stage of the spectral clustering algorithm. It needs to initialize the cluster center randomly, which will result in the instability of the results. In this paper, an improved spectral clustering algorithm is proposed to solve these two problems. In constructing a similarity matrix, we proposed an improved Gaussian kernel function, which is based on the distance information of some nearest neighbors and can adaptively select scale parameters. In the clustering stage, beetle antennae search algorithm with damping factor is proposed to complete the clustering to overcome the problem of instability of the clustering results. In the experiment, we use four artificial data sets and seven UCI data sets to verify the performance of our algorithm. In addition, four images in BSDS500 image data sets are segmented in this paper, and the results show that our algorithm is better than other comparison algorithms in image segmentation.

Download Full-text

Efficient parallel spectral clustering algorithm design for large data sets under cloud computing environment

Journal of Cloud Computing Advances Systems and Applications ◽

10.1186/2192-113x-2-18 ◽

2013 ◽

Vol 2 (1) ◽

pp. 18 ◽

Cited By ~ 9

Author(s):

Ran Jin ◽

Chunhai Kou ◽

Ruijuan Liu ◽

Yefeng Li

Keyword(s):

Cloud Computing ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Algorithm Design ◽

Large Data ◽

Large Data Sets ◽

Data Sets ◽

Computing Environment ◽

Cloud Computing Environment ◽

Spectral Clustering Algorithm

Download Full-text

Rating-Based Collaborative Filtering Using Spectral Clustering Algorithm

Journal of Physics Conference Series ◽

10.1088/1742-6596/1549/3/032022 ◽

2020 ◽

Vol 1549 ◽

pp. 032022

Author(s):

Yongjie Yan ◽

Hui Xie ◽

Li Ma

Keyword(s):

Collaborative Filtering ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Spectral Clustering Algorithm

Download Full-text

An improved spectral clustering algorithm based on local neighbors in kernel space

Computer Science and Information Systems ◽

10.2298/csis110415064l ◽

2011 ◽

Vol 8 (4) ◽

pp. 1143-1157 ◽

Cited By ~ 5

Author(s):

Xinyue Liu ◽

Xing Yong ◽

Hongfei Lin

Keyword(s):

Real World ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Sparse Matrix ◽

Feature Space ◽

Data Sets ◽

Kernel Space ◽

Real World Data ◽

World Data ◽

Linear Reconstruction

Similarity matrix is critical to the performance of spectral clustering. Mercer kernels have become popular largely due to its successes in applying kernel methods such as kernel PCA. A novel spectral clustering method is proposed based on local neighborhood in kernel space (SC-LNK), which assumes that each data point can be linearly reconstructed from its neighbors. The SC-LNK algorithm tries to project the data to a feature space by the Mercer kernel, and then learn a sparse matrix using linear reconstruction as the similarity graph for spectral clustering. Experiments have been performed on synthetic and real world data sets and have shown that spectral clustering based on linear reconstruction in kernel space outperforms the conventional spectral clustering and the other two algorithms, especially in real world data sets.

Download Full-text

Evaluation of Two-Step Spectral Clustering Algorithm for Large Untypical Data Sets

Data Analysis and Classification - Studies in Classification, Data Analysis, and Knowledge Organization ◽

10.1007/978-3-030-75190-6_1 ◽

2021 ◽

pp. 3-9

Author(s):

Andrzej Dudek

Keyword(s):

Spectral Clustering ◽

Clustering Algorithm ◽

Data Sets ◽

Spectral Clustering Algorithm

Download Full-text

Recommendation system using the k-nearest neighbors and singular value decomposition algorithms

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i6.pp5541-5548 ◽

2021 ◽

Vol 11 (6) ◽

pp. 5541

Author(s):

Badr Hssina ◽

Abdelkader Grota ◽

Mohammed Erritali

Keyword(s):

Singular Value Decomposition ◽

Collaborative Filtering ◽

Recommendation System ◽

Hybrid Approach ◽

Singular Value ◽

User Preferences ◽

Decomposition Algorithms ◽

K Nearest Neighbors ◽

The Matrix ◽

Value Decomposition

<span>Nowadays, recommendation systems are used successfully to provide items (example: movies, music, books, news, images) tailored to user preferences. Amongst the approaches existing to recommend adequate content, we use the collaborative filtering approach of finding the information that satisfies the user by using the reviews of other users. These reviews are stored in matrices that their sizes increase exponentially to predict whether an item is relevant or not. The evaluation shows that these systems provide unsatisfactory recommendations because of what we call the cold start factor. Our objective is to apply a hybrid approach to improve the quality of our recommendation system. The benefit of this approach is the fact that it does not require a new algorithm for calculating the predictions. We are going to apply two algorithms: k-nearest neighbours (KNN) and the matrix factorization algorithm of collaborative filtering which are based on the method of (singular-value-decomposition). Our combined model has a very high precision and the experiments show that our method can achieve better results.</span>

Download Full-text

The Larger the Better: Analysis of a Scalable Spectral Clustering Algorithm with Cosine Similarity

10.3233/faia210280 ◽

2021 ◽

Author(s):

Guangliang Chen

Keyword(s):

Perturbation Analysis ◽

Spectral Clustering ◽

Clustering Algorithm ◽

Linear Complexity ◽

Large Data ◽

Cosine Similarity ◽

Large Data Sets ◽

Data Sets ◽

Scalable Algorithm ◽

Spectral Clustering Algorithm

Chen (2018) proposed a scalable spectral clustering algorithm for cosine similarity to handle the task of clustering large data sets. It runs extremely fast, with a linear complexity in the size of the data, and achieves state of the art accuracy. This paper conducts perturbation analysis of the algorithm to understand the effect of discarding a perturbation term in an eigendecomposition step. Our results show that the accuracy of the approximation by the scalable algorithm depends on the connectivity of the clusters, their separation and sizes, and is especially accurate for large data sets.

Download Full-text

An Enhanced Spectral Clustering Algorithm with S-Distance

Symmetry ◽

10.3390/sym13040596 ◽

2021 ◽

Vol 13 (4) ◽

pp. 596

Author(s):

Krishna Kumar Sharma ◽

Ayan Seal ◽

Enrique Herrera-Viedma ◽

Ondrej Krejcar

Keyword(s):

Spectral Clustering ◽

Clustering Algorithm ◽

Spatial Clustering ◽

Clustering Algorithms ◽

Rank Test ◽

Customer Churn ◽

Signed Rank ◽

Signed Rank Test ◽

Spectral Clustering Algorithm ◽

Industrial Databases

Calculating and monitoring customer churn metrics is important for companies to retain customers and earn more profit in business. In this study, a churn prediction framework is developed by modified spectral clustering (SC). However, the similarity measure plays an imperative role in clustering for predicting churn with better accuracy by analyzing industrial data. The linear Euclidean distance in the traditional SC is replaced by the non-linear S-distance (Sd). The Sd is deduced from the concept of S-divergence (SD). Several characteristics of Sd are discussed in this work. Assays are conducted to endorse the proposed clustering algorithm on four synthetics, eight UCI, two industrial databases and one telecommunications database related to customer churn. Three existing clustering algorithms—k-means, density-based spatial clustering of applications with noise and conventional SC—are also implemented on the above-mentioned 15 databases. The empirical outcomes show that the proposed clustering algorithm beats three existing clustering algorithms in terms of its Jaccard index, f-score, recall, precision and accuracy. Finally, we also test the significance of the clustering results by the Wilcoxon’s signed-rank test, Wilcoxon’s rank-sum test, and sign tests. The relative study shows that the outcomes of the proposed algorithm are interesting, especially in the case of clusters of arbitrary shape.

Download Full-text

Incorporating Singular Value Decomposition in User-based Collaborative Filtering Technique for a Movie Recommendation System

Proceedings of the 2019 the International Conference on Pattern Recognition and Artificial Intelligence - PRAI '19 ◽

10.1145/3357777.3357782 ◽

2019 ◽

Cited By ~ 1

Author(s):

Vito Xituo Chen ◽

Tiffany Y. Tang

Keyword(s):

Singular Value Decomposition ◽

Collaborative Filtering ◽

Recommendation System ◽

Singular Value ◽

Filtering Technique ◽

Movie Recommendation ◽

Value Decomposition

Download Full-text