Gaussian Mixture Model Clustering with Incomplete Data

Author(s):  
Yi Zhang ◽  
Miaomiao Li ◽  
Siwei Wang ◽  
Sisi Dai ◽  
Lei Luo ◽  
...  

Gaussian mixture model (GMM) clustering has been extensively studied due to its effectiveness and efficiency. Though demonstrating promising performance in various applications, it cannot effectively address the absent features among data, which is not uncommon in practical applications. In this article, different from existing approaches that first impute the absence and then perform GMM clustering tasks on the imputed data, we propose to integrate the imputation and GMM clustering into a unified learning procedure. Specifically, the missing data is filled by the result of GMM clustering, and the imputed data is then taken for GMM clustering. These two steps alternatively negotiate with each other to achieve optimum. By this way, the imputed data can best serve for GMM clustering. A two-step alternative algorithm with proved convergence is carefully designed to solve the resultant optimization problem. Extensive experiments have been conducted on eight UCI benchmark datasets, and the results have validated the effectiveness of the proposed algorithm.

2019 ◽  
Vol 178 ◽  
pp. 84-97 ◽  
Author(s):  
Wenzhen Jia ◽  
Yanyan Tan ◽  
Li Liu ◽  
Jing Li ◽  
Huaxiang Zhang ◽  
...  

Author(s):  
Delshad Fakoor ◽  
Vafa Maihami ◽  
Reza Maihami

Changing and moving toward online shopping has made it necessary to customize customers’ needs and provide them more selective options. The buyers search the products’ features before deciding to purchase items. The recommender systems facilitate the searching task for customers via narrowing down the search space within the specific products that align the customer needs. Clustering, as a typical machine learning approach, is applied in recommender systems. As an information filtering method, a recommender system clusters user’s data to indicate the required factors for more accurate predictions by calculating the similarity between members of a cluster. In this study, using the Gaussian mixture model clustering and considering the scores distance and the value of scores in the Pearson correlation coefficient, a new method is introduced for predicting scores in machine learning recommender systems. To study the proposed method’s performance, a Movie Lens data set is evaluated, and the results are compared to some other recommender systems, including the Pearson correlation coefficients similarity criteria, K-means, and fuzzy C-means algorithms. The simulation results indicate that our method has less error than others by increasing the number of neighbors. The results also illustrate that when the number of users increases, the proposed method’s accuracy will increase. The reason is that the Gaussian mixture clustering chooses similar users and considers the scores distance in choosing similar neighbors to the active user.


2021 ◽  
Vol 13 (2) ◽  
pp. 37
Author(s):  
FengLei Yang ◽  
Fei Liu ◽  
ShanShan Liu

Collaborative filtering (CF) is a widely used method in recommendation systems. Linear models are still the mainstream of collaborative filtering research methods, but non-linear probabilistic models are beyond the limit of linear model capacity. For example, variational autoencoders (VAEs) have been extensively used in CF, and have achieved excellent results. Aiming at the problem of the prior distribution for the latent codes of VAEs in traditional CF is too simple, which makes the implicit variable representations of users and items too poor. This paper proposes a variational autoencoder that uses a Gaussian mixture model for latent factors distribution for CF, GVAE-CF. On this basis, an optimization function suitable for GVAE-CF is proposed. In our experimental evaluation, we show that the recommendation performance of GVAE-CF outperforms the previously proposed VAE-based models on several popular benchmark datasets in terms of recall and normalized discounted cumulative gain (NDCG), thus proving the effectiveness of the algorithm.


Sign in / Sign up

Export Citation Format

Share Document