IMPROVING MATRIX FACTORIZATION-BASED RECOMMENDER VIA ENSEMBLE METHODS

One of the most popular approaches to Collaborative Filtering is based on Matrix Factorization (MF). In this paper, we focus on improving MF-based recommender's accuracy by homogeneous ensemble methods. To build such ensembles, we investigate a series of methods primarily in two aspects: (i) manipulating the training examples, including bagging, AdaBoost, and Forward Stepwise Additive Regression; (ii) injecting randomness to the base models' training settings, including randomizing the initializing parameters and randomizing the training sequences. Each method is evaluated on two large, real datasets, and then the effective methods are combined to form a cascade MF ensemble scheme. The validation results on experiment datasets demonstrate that compared to a single MF-based recommender, our ensemble scheme could obtain a significant improvement in the prediction accuracy.

Download Full-text

Modeling Implicit Trust in Matrix Factorization-Based Collaborative Filtering

Applied Sciences ◽

10.3390/app9204378 ◽

2019 ◽

Vol 9 (20) ◽

pp. 4378 ◽

Cited By ~ 2

Author(s):

Yuan ◽

Zahir ◽

Yang

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization ◽

Prediction Accuracy ◽

State Of The Art ◽

Side Information ◽

Initial Trust ◽

The Social ◽

Implicit Trust ◽

Value Decomposition ◽

Better Than

Recommendation systems often use side information to both alleviate problems, such as the cold start problem and data sparsity, and increase prediction accuracy. One such piece of side information, which has been widely investigated in addressing such challenges, is trust. However, the difficulty in obtaining explicit relationship data has led researchers to infer trust values from other means such as the user-to-item relationship. This paper proposes a model to improve prediction accuracy by applying the trust relationship between the user and item ratings. Two approaches to implement trust into prediction are proposed: one involves the use of estimated trust, and the other involves the initial trust. The efficiency of the proposed method is verified by comparing the obtained results with four well-known methods, including the state-of-the-art deep learning-based method of neural graph collaborative filtering (NGCF). The experimental results demonstrate that the proposed method performs significantly better than the NGCF, and the three other matrix factorization methods, namely, the singular value decomposition (SVD), SVD++, and the social matrix factorization (SocialMF).

Download Full-text

Optimizing Latent Factors and Collaborative Filtering for Students’ Performance Prediction

Applied Sciences ◽

10.3390/app10165601 ◽

2020 ◽

Vol 10 (16) ◽

pp. 5601

Author(s):

Juan A. Gómez-Pulido ◽

Arturo Durán-Domínguez ◽

Francisco Pajuelo-Holguera

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization ◽

Prediction Accuracy ◽

Search Space ◽

Learning Rate ◽

Latent Factors ◽

Gradient Descent Algorithm ◽

Real World Datasets ◽

The Right ◽

Key Aspects

The problem of predicting students’ performance has been recently tackled by using matrix factorization, a popular method applied for collaborative filtering based recommender systems. This problem consists of predicting the unknown performance or score of a particular student for a task s/he did not complete or did not attend, according to the scores of the tasks s/he did complete and the scores of the colleagues who completed the task in question. The solving method considers matrix factorization and a gradient descent algorithm in order to build a prediction model that minimizes the error in the prediction of test data. However, we identified two key aspects that influence the accuracy of the prediction. On the one hand, the model involves a pair of important parameters: the learning rate and the regularization factor, for which there are no fixed values for any experimental case. On the other hand, the datasets are extracted from virtual classrooms on online campuses and have a number of implicit latent factors. The right figures are difficult to ascertain, as they depend on the nature of the dataset: subject, size, type of learning, academic environment, etc. This paper proposes some approaches to improve the prediction accuracy by optimizing the values of the latent factors, learning rate, and regularization factor. To this end, we apply optimization algorithms that cover a wide search space. The experimental results obtained from real-world datasets improved the prediction accuracy in the context of a thorough search for predefined values. Obtaining optimized values of these parameters allows us to apply them to further predictions for similar datasets.

Download Full-text

MSGD: A Novel Matrix Factorization Approach for Large-Scale Collaborative Filtering Recommender Systems on GPUs

IEEE Transactions on Parallel and Distributed Systems ◽

10.1109/tpds.2017.2718515 ◽

2018 ◽

Vol 29 (7) ◽

pp. 1530-1544 ◽

Cited By ~ 32

Author(s):

Hao Li ◽

Kenli Li ◽

Jiyao An ◽

Keqin Li

Keyword(s):

Collaborative Filtering ◽

Recommender Systems ◽

Matrix Factorization ◽

Large Scale ◽

Factorization Approach

Download Full-text

Application and Research of Improved Probability Matrix Factorization Techniques in Collaborative Filtering

International Journal of Control and Automation ◽

10.14257/ijca.2014.7.8.08 ◽

2014 ◽

Vol 7 (8) ◽

pp. 79-92 ◽

Cited By ~ 7

Author(s):

Zhijun Zhang ◽

Hong Liu

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization

Download Full-text

Investigating Overparameterization for Non-Negative Matrix Factorization in Collaborative Filtering

10.1145/3460231.3478854 ◽

2021 ◽

Author(s):

Yuhi Kawakami ◽

Mahito Sugiyama

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization ◽

Non Negative Matrix Factorization

Download Full-text

Quantile Matrix Factorization for Collaborative Filtering

Lecture Notes in Business Information Processing - E-Commerce and Web Technologies ◽

10.1007/978-3-642-15208-5_23 ◽

2010 ◽

pp. 253-264 ◽

Cited By ~ 2

Author(s):

Alexandros Karatzoglou ◽

Markus Weimer

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization

Download Full-text

Kernelized Matrix Factorization for Collaborative Filtering

Proceedings of the 2016 SIAM International Conference on Data Mining ◽

10.1137/1.9781611974348.43 ◽

2016 ◽

Cited By ~ 20

Author(s):

Xinyue Liu ◽

Chara Aggarwal ◽

Yu-Feng Li ◽

Xiaugnan Kong ◽

Xinyuan Sun ◽

...

Keyword(s):

Collaborative Filtering ◽

Matrix Factorization

Download Full-text

Collaborative Filtering Recommendation Using Nonnegative Matrix Factorization in GPU-Accelerated Spark Platform

Scientific Programming ◽

10.1155/2021/8841133 ◽

2021 ◽

Vol 2021 ◽

pp. 1-15

Author(s):

Bing Tang ◽

Linyao Kang ◽

Li Zhang ◽

Feiyan Guo ◽

Haiwu He

Keyword(s):

Collaborative Filtering ◽

Processing Speed ◽

Matrix Factorization ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix ◽

Experimental Results ◽

Computational Time ◽

Data Sets ◽

Heterogeneous Cluster ◽

The Matrix

Nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data compression and its capability of extracting highly interpretable parts from data sets, and it has also been applied to various fields, such as recommendations, image analysis, and text clustering. However, as the size of the matrix increases, the processing speed of nonnegative matrix factorization is very slow. To solve this problem, this paper proposes a parallel algorithm based on GPU for NMF in Spark platform, which makes full use of the advantages of in-memory computation mode and GPU acceleration. The new GPU-accelerated NMF on Spark platform is evaluated in a 4-node Spark heterogeneous cluster using Google Compute Engine by configuring each node a NVIDIA K80 CUDA device, and experimental results indicate that it is competitive in terms of computational time against the existing solutions on a variety of matrix orders. Furthermore, a GPU-accelerated NMF-based parallel collaborative filtering (CF) algorithm is also proposed, utilizing the advantages of data dimensionality reduction and feature extraction of NMF, as well as the multicore parallel computing mode of CUDA. Using real MovieLens data sets, experimental results have shown that the parallelization of NMF-based collaborative filtering on Spark platform effectively outperforms traditional user-based and item-based CF with a higher processing speed and higher recommendation accuracy.

Download Full-text