IMPROVING MATRIX FACTORIZATION-BASED RECOMMENDER VIA ENSEMBLE METHODS

2011 ◽  
Vol 10 (03) ◽  
pp. 539-561 ◽  
Author(s):  
XIN LUO ◽  
YUANXIN OUYANG ◽  
XIONG ZHANG

One of the most popular approaches to Collaborative Filtering is based on Matrix Factorization (MF). In this paper, we focus on improving MF-based recommender's accuracy by homogeneous ensemble methods. To build such ensembles, we investigate a series of methods primarily in two aspects: (i) manipulating the training examples, including bagging, AdaBoost, and Forward Stepwise Additive Regression; (ii) injecting randomness to the base models' training settings, including randomizing the initializing parameters and randomizing the training sequences. Each method is evaluated on two large, real datasets, and then the effective methods are combined to form a cascade MF ensemble scheme. The validation results on experiment datasets demonstrate that compared to a single MF-based recommender, our ensemble scheme could obtain a significant improvement in the prediction accuracy.

2019 ◽  
Vol 9 (20) ◽  
pp. 4378 ◽  
Author(s):  
Yuan ◽  
Zahir ◽  
Yang

Recommendation systems often use side information to both alleviate problems, such as the cold start problem and data sparsity, and increase prediction accuracy. One such piece of side information, which has been widely investigated in addressing such challenges, is trust. However, the difficulty in obtaining explicit relationship data has led researchers to infer trust values from other means such as the user-to-item relationship. This paper proposes a model to improve prediction accuracy by applying the trust relationship between the user and item ratings. Two approaches to implement trust into prediction are proposed: one involves the use of estimated trust, and the other involves the initial trust. The efficiency of the proposed method is verified by comparing the obtained results with four well-known methods, including the state-of-the-art deep learning-based method of neural graph collaborative filtering (NGCF). The experimental results demonstrate that the proposed method performs significantly better than the NGCF, and the three other matrix factorization methods, namely, the singular value decomposition (SVD), SVD++, and the social matrix factorization (SocialMF).


2020 ◽  
Vol 10 (16) ◽  
pp. 5601
Author(s):  
Juan A. Gómez-Pulido ◽  
Arturo Durán-Domínguez ◽  
Francisco Pajuelo-Holguera

The problem of predicting students’ performance has been recently tackled by using matrix factorization, a popular method applied for collaborative filtering based recommender systems. This problem consists of predicting the unknown performance or score of a particular student for a task s/he did not complete or did not attend, according to the scores of the tasks s/he did complete and the scores of the colleagues who completed the task in question. The solving method considers matrix factorization and a gradient descent algorithm in order to build a prediction model that minimizes the error in the prediction of test data. However, we identified two key aspects that influence the accuracy of the prediction. On the one hand, the model involves a pair of important parameters: the learning rate and the regularization factor, for which there are no fixed values for any experimental case. On the other hand, the datasets are extracted from virtual classrooms on online campuses and have a number of implicit latent factors. The right figures are difficult to ascertain, as they depend on the nature of the dataset: subject, size, type of learning, academic environment, etc. This paper proposes some approaches to improve the prediction accuracy by optimizing the values of the latent factors, learning rate, and regularization factor. To this end, we apply optimization algorithms that cover a wide search space. The experimental results obtained from real-world datasets improved the prediction accuracy in the context of a thorough search for predefined values. Obtaining optimized values of these parameters allows us to apply them to further predictions for similar datasets.


Author(s):  
Xinyue Liu ◽  
Chara Aggarwal ◽  
Yu-Feng Li ◽  
Xiaugnan Kong ◽  
Xinyuan Sun ◽  
...  

2021 ◽  
Vol 2021 ◽  
pp. 1-15
Author(s):  
Bing Tang ◽  
Linyao Kang ◽  
Li Zhang ◽  
Feiyan Guo ◽  
Haiwu He

Nonnegative matrix factorization (NMF) has been introduced as an efficient way to reduce the complexity of data compression and its capability of extracting highly interpretable parts from data sets, and it has also been applied to various fields, such as recommendations, image analysis, and text clustering. However, as the size of the matrix increases, the processing speed of nonnegative matrix factorization is very slow. To solve this problem, this paper proposes a parallel algorithm based on GPU for NMF in Spark platform, which makes full use of the advantages of in-memory computation mode and GPU acceleration. The new GPU-accelerated NMF on Spark platform is evaluated in a 4-node Spark heterogeneous cluster using Google Compute Engine by configuring each node a NVIDIA K80 CUDA device, and experimental results indicate that it is competitive in terms of computational time against the existing solutions on a variety of matrix orders. Furthermore, a GPU-accelerated NMF-based parallel collaborative filtering (CF) algorithm is also proposed, utilizing the advantages of data dimensionality reduction and feature extraction of NMF, as well as the multicore parallel computing mode of CUDA. Using real MovieLens data sets, experimental results have shown that the parallelization of NMF-based collaborative filtering on Spark platform effectively outperforms traditional user-based and item-based CF with a higher processing speed and higher recommendation accuracy.


2017 ◽  
Vol 86 ◽  
pp. 62-67 ◽  
Author(s):  
Vikas Kumar ◽  
Arun K. Pujari ◽  
Sandeep Kumar Sahu ◽  
Venkateswara Rao Kagita ◽  
Vineet Padmanabhan

Author(s):  
Ariel Bar ◽  
Lior Rokach ◽  
Guy Shani ◽  
Bracha Shapira ◽  
Alon Schclar

Sign in / Sign up

Export Citation Format

Share Document