A Dual-Attention Autoencoder Network for Efficient Recommendation System

Accelerated development of mobile networks and applications leads to the exponential expansion of resources, which causes problems such as trek and overload of information. One of the practical approaches to ease these problems is recommendation systems (RSs) that can provide individualized service. Video recommendation is one of the most critical recommendation services. However, achieving satisfactory recommendation service on the sparse data is difficult for video recommendation service. Moreover, the cold start problem further exacerbates the research challenge. Recent state-of-the-art works attempted to solve this problem by utilizing the user and item information from some other perspective. However, the significance of user and item information changes under different applications. This paper proposes an autoencoder model to improve recommendation efficiency by utilizing attribute information and implementing the proposed algorithm for video recommendation. In the proposed model, we first extract the user features and the video features by combining the user attribute and the video category information simultaneously. Then, we integrate the attention mechanism into the extracted features to generate the vital features. Finally, we incorporate the user and item potential factor to generate the probability matrix and defines the user-item rating matrix using the factorized probability matrix. Experimental results on two shared datasets demonstrates that the proposed model can effectively ameliorate video recommendation quality compared with the state-of-the-art methods.

Download Full-text

Enhanced Collaborative Filtering for Personalized E-Government Recommendation

Applied Sciences ◽

10.3390/app112412119 ◽

2021 ◽

Vol 11 (24) ◽

pp. 12119

Author(s):

Ninghua Sun ◽

Tao Chen ◽

Wenshan Guo ◽

Longya Ran

Keyword(s):

Collaborative Filtering ◽

Recommendation System ◽

Information Overload ◽

State Of The Art ◽

Interaction Matrix ◽

Implicit Feedback ◽

Item Information ◽

Negative Item ◽

Latent Features ◽

Government Websites

The problems with the information overload of e-government websites have been a big obstacle for users to make decisions. One promising approach to solve this problem is to deploy an intelligent recommendation system on e-government platforms. Collaborative filtering (CF) has shown its superiority by characterizing both items and users by the latent features inferred from the user–item interaction matrix. A fundamental challenge is to enhance the expression of the user or/and item embedding latent features from the implicit feedback. This problem negatively affected the performance of the recommendation system in e-government. In this paper, we firstly propose to learn positive items’ latent features by leveraging both the negative item information and the original embedding features. We present the negative items mixed collaborative filtering (NMCF) method to enhance the CF-based recommender system. Such mixing information is beneficial for extending the expressiveness of the latent features. Comprehensive experimentation on a real-world e-government dataset showed that our approach improved the performance significantly compared with the state-of-the-art baseline algorithms.

Download Full-text

An Improved Deep Mutual-Attention Learning Model for Person Re-Identification

Symmetry ◽

10.3390/sym12030358 ◽

2020 ◽

Vol 12 (3) ◽

pp. 358

Author(s):

Miftah Bedru Jamal ◽

Jiang Zhengang ◽

Fang Ming

Keyword(s):

Feature Extraction ◽

Similarity Measure ◽

Adaptive Learning ◽

Large Scale ◽

State Of The Art ◽

Background Illumination ◽

Vast Number ◽

Proposed Model ◽

Art Works ◽

Human Pose

Person re-identification is the task of matching pedestrian images across a network of non-overlapping camera views. It poses aggregated challenges resulted from random human pose, clutter from the background, illumination variations, and other factors. There has been a vast number of studies in recent years with promising success. However, key challenges have not been adequately addressed and continue to result in sub-optimal performance. Attention-based person re-identification gains more popularity in identifying discriminatory features from person images. Its potential in terms of extracting features common to a pair of person images across the feature extraction pipeline has not been be fully exploited. In this paper, we propose a novel attention-based Siamese network driven by a mutual-attention module decomposed into spatial and channel components. The proposed mutual-attention module not only leads feature extraction to the discriminative part of individual images, but also fuses mutual features symmetrically across pairs of person images to get informative regions common to both input images. Our model simultaneously learns feature embedding for discriminative cues and the similarity measure. The proposed model is optimized with multi-task loss, namely classification and verification loss. It is further optimized by a learnable mutual-attention module to facilitate an efficient and adaptive learning. The proposed model is thoroughly evaluated on extensively used large-scale datasets, Market-1501 and Duke-MTMC-ReID. Our experimental results show competitive results with the state-of-the-art works and the effectiveness of the mutual-attention module.

Download Full-text

Various Methodologies for Micro-Video Recommendation System: A Survey

SSRN Electronic Journal ◽

10.2139/ssrn.3443393 ◽

2018 ◽

Author(s):

Jyoti Raj ◽

Amirul Hoque ◽

Ashim Saha

Keyword(s):

Recommendation System ◽

System A ◽

Video Recommendation

Download Full-text

Enhanced context-aware recommendation using topic modeling and particle swarm optimization

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210331 ◽

2021 ◽

pp. 1-16

Author(s):

Ibtissem Gasmi ◽

Mohamed Walid Azizi ◽

Hassina Seridi-Bouchelaghem ◽

Nabiha Azizi ◽

Samir Brahim Belhaouari

Keyword(s):

Topic Modeling ◽

Latent Dirichlet Allocation ◽

State Of The Art ◽

Weighting Function ◽

Contextual Factors ◽

Pearson Correlation ◽

Correlation Coefficients ◽

Pso Algorithm ◽

Context Aware ◽

Proposed Model

Context-Aware Recommender System (CARS) suggests more relevant services by adapting them to the user’s specific context situation. Nevertheless, the use of many contextual factors can increase data sparsity while few context parameters fail to introduce the contextual effects in recommendations. Moreover, several CARSs are based on similarity algorithms, such as cosine and Pearson correlation coefficients. These methods are not very effective in the sparse datasets. This paper presents a context-aware model to integrate contextual factors into prediction process when there are insufficient co-rated items. The proposed algorithm uses Latent Dirichlet Allocation (LDA) to learn the latent interests of users from the textual descriptions of items. Then, it integrates both the explicit contextual factors and their degree of importance in the prediction process by introducing a weighting function. Indeed, the PSO algorithm is employed to learn and optimize weights of these features. The results on the Movielens 1 M dataset show that the proposed model can achieve an F-measure of 45.51% with precision as 68.64%. Furthermore, the enhancement in MAE and RMSE can respectively reach 41.63% and 39.69% compared with the state-of-the-art techniques.

Download Full-text

A Deep Learning Approach to Predict Autism Spectrum Disorder Using Multisite Resting-State fMRI

Applied Sciences ◽

10.3390/app11083636 ◽

2021 ◽

Vol 11 (8) ◽

pp. 3636

Author(s):

Faria Zarin Subah ◽

Kaushik Deb ◽

Pranab Kumar Dhar ◽

Takeshi Koshiba

Keyword(s):

Autism Spectrum Disorder ◽

Resting State ◽

State Of The Art ◽

Resting State Fmri ◽

Autism Spectrum ◽

Spectrum Disorder ◽

Bootstrap Analysis ◽

Proposed Model ◽

Art Methods ◽

The Mean

Autism spectrum disorder (ASD) is a complex and degenerative neuro-developmental disorder. Most of the existing methods utilize functional magnetic resonance imaging (fMRI) to detect ASD with a very limited dataset which provides high accuracy but results in poor generalization. To overcome this limitation and to enhance the performance of the automated autism diagnosis model, in this paper, we propose an ASD detection model using functional connectivity features of resting-state fMRI data. Our proposed model utilizes two commonly used brain atlases, Craddock 200 (CC200) and Automated Anatomical Labelling (AAL), and two rarely used atlases Bootstrap Analysis of Stable Clusters (BASC) and Power. A deep neural network (DNN) classifier is used to perform the classification task. Simulation results indicate that the proposed model outperforms state-of-the-art methods in terms of accuracy. The mean accuracy of the proposed model was 88%, whereas the mean accuracy of the state-of-the-art methods ranged from 67% to 85%. The sensitivity, F1-score, and area under receiver operating characteristic curve (AUC) score of the proposed model were 90%, 87%, and 96%, respectively. Comparative analysis on various scoring strategies show the superiority of BASC atlas over other aforementioned atlases in classifying ASD and control.

Download Full-text

Equivariant Adversarial Network for Image-to-image Translation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3458280 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-14

Author(s):

Masoumeh Zareapoor ◽

Jie Yang

Keyword(s):

State Of The Art ◽

Generative Models ◽

Generative Model ◽

Target Domain ◽

Adversarial Network ◽

Proposed Model ◽

Image Translation ◽

Great Performance ◽

Representative Model ◽

The Ideal

Image-to-Image translation aims to learn an image from a source domain to a target domain. However, there are three main challenges, such as lack of paired datasets, multimodality, and diversity, that are associated with these problems and need to be dealt with. Convolutional neural networks (CNNs), despite of having great performance in many computer vision tasks, they fail to detect the hierarchy of spatial relationships between different parts of an object and thus do not form the ideal representative model we look for. This article presents a new variation of generative models that aims to remedy this problem. We use a trainable transformer, which explicitly allows the spatial manipulation of data within training. This differentiable module can be augmented into the convolutional layers in the generative model, and it allows to freely alter the generated distributions for image-to-image translation. To reap the benefits of proposed module into generative model, our architecture incorporates a new loss function to facilitate an effective end-to-end generative learning for image-to-image translation. The proposed model is evaluated through comprehensive experiments on image synthesizing and image-to-image translation, along with comparisons with several state-of-the-art algorithms.

Download Full-text

Analytical Model for Estimating the Impact of Changing the Nominal Power Parameter in LTE

Mobile Information Systems ◽

10.1155/2018/2458204 ◽

2018 ◽

Vol 2018 ◽

pp. 1-7

Author(s):

A. B. Vallejo-Mora ◽

M. Toril ◽

S. Luna-Ramírez ◽

M. Regueira ◽

S. Pedraza

Keyword(s):

Analytical Model ◽

Mobile Networks ◽

Radio Resource Management ◽

Estimation Accuracy ◽

Model Assessment ◽

Power Parameter ◽

Management Procedure ◽

Proposed Model ◽

A Cell ◽

The Impact

UpLink Power Control (ULPC) is a key radio resource management procedure in mobile networks. In this paper, an analytical model for estimating the impact of increasing the nominal power parameter in the ULPC algorithm for the Physical Uplink Shared CHannel (PUSCH) in Long Term Evolution (LTE) is presented. The aim of the model is to predict the effect of changing the nominal power parameter in a cell on the interference and Signal-to-Interference-plus-Noise Ratio (SINR) of that cell and its neighbors from network statistics. Model assessment is carried out by means of a field trial where the nominal power parameter is increased in some cells of a live LTE network. Results show that the proposed model achieves reasonable estimation accuracy, provided uplink traffic does not change significantly.

Download Full-text

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/746 ◽

2019 ◽

Author(s):

Yinfei Yang ◽

Gustavo Hernandez Abrego ◽

Steve Yuan ◽

Mandy Guo ◽

Qinlan Shen ◽

...

Keyword(s):

United Nations ◽

State Of The Art ◽

Cosine Similarity ◽

Retrieval Task ◽

Parallel Corpus ◽

Similar Performance ◽

Second Stage ◽

Current State ◽

Proposed Model ◽

Document Level

In this paper, we present an approach to learn multilingual sentence embeddings using a bi-directional dual-encoder with additive margin softmax. The embeddings are able to achieve state-of-the-art results on the United Nations (UN) parallel corpus retrieval task. In all the languages tested, the system achieves P@1 of 86% or higher. We use pairs retrieved by our approach to train NMT models that achieve similar performance to models trained on gold pairs. We explore simple document-level embeddings constructed by averaging our sentence embeddings. On the UN document-level retrieval task, document embeddings achieve around 97% on P@1 for all experimented language pairs. Lastly, we evaluate the proposed model on the BUCC mining task. The learned embeddings with raw cosine similarity scores achieve competitive results compared to current state-of-the-art models, and with a second-stage scorer we achieve a new state-of-the-art level on this task.

Download Full-text

Multi-agent Attentional Activity Recognition

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/186 ◽

2019 ◽

Cited By ~ 3

Author(s):

Kaixuan Chen ◽

Lina Yao ◽

Dalin Zhang ◽

Bin Guo ◽

Zhiwen Yu

Keyword(s):

Activity Recognition ◽

State Of The Art ◽

Body Part ◽

Body Parts ◽

Temporal Attention ◽

Attention Model ◽

Proposed Model ◽

Collective Motions ◽

Multi Agent ◽

Real World Datasets

Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism helps intelligently select informative modalities and their active periods. And the multiple agents in the proposed model represent activities with collective motions across body parts by independently selecting modalities associated with single motions. With a joint recognition goal, the agents share gained information and coordinate their selection policies to learn the optimal recognition model. The experimental results on four real-world datasets demonstrate that the proposed model outperforms the state-of-the-art methods.

Download Full-text

Learning Feature Interactions with Lorentzian Factorization Machine

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6119 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6470-6477

Author(s):

Canran Xu ◽

Ming Wu

Keyword(s):

Deep Learning ◽

Hyperbolic Space ◽

Recommendation System ◽

Triangle Inequality ◽

State Of The Art ◽

Learning Methods ◽

New Model ◽

User Behaviors ◽

Feature Interactions ◽

Factorization Machine

Learning representations for feature interactions to model user behaviors is critical for recommendation system and click-trough rate (CTR) predictions. Recent advances in this area are empowered by deep learning methods which could learn sophisticated feature interactions and achieve the state-of-the-art result in an end-to-end manner. These approaches require large number of training parameters integrated with the low-level representations, and thus are memory and computational inefficient. In this paper, we propose a new model named “LorentzFM” that can learn feature interactions embedded in a hyperbolic space in which the violation of triangle inequality for Lorentz distances is available. To this end, the learned representation is benefited by the peculiar geometric properties of hyperbolic triangles, and result in a significant reduction in the number of parameters (20% to 80%) because all the top deep learning layers are not required. With such a lightweight architecture, LorentzFM achieves comparable and even materially better results than the deep learning methods such as DeepFM, xDeepFM and Deep & Cross in both recommendation and CTR prediction tasks.

Download Full-text