A recommendations model with multiaspect awareness and hierarchical user-product attention mechanisms

Zhongqin Bi; Shuming Dou; Zhe Liu; Yongbin Li

doi:10.2298/csis190925024b

A recommendations model with multiaspect awareness and hierarchical user-product attention mechanisms

Computer Science and Information Systems ◽

10.2298/csis190925024b ◽

2020 ◽

Vol 17 (3) ◽

pp. 849-865

Author(s):

Zhongqin Bi ◽

Shuming Dou ◽

Zhe Liu ◽

Yongbin Li

Keyword(s):

State Of The Art ◽

Weight Vector ◽

User Preferences ◽

The Other ◽

Attention Networks ◽

Proposed Model ◽

Network Methods ◽

Public Datasets ◽

Novel Model ◽

Attention Weight

Neural network methods have been trained to satisfactorily learn user/product representations from textual reviews. A representation can be considered as a multiaspect attention weight vector. However, in several existing methods, it is assumed that the user representation remains unchanged even when the user interacts with products having diverse characteristics, which leads to inaccurate recommendations. To overcome this limitation, this paper proposes a novel model to capture the varying attention of a user for different products by using a multilayer attention framework. First, two individual hierarchical attention networks are used to encode the users and products to learn the user preferences and product characteristics from review texts. Then, we design an attention network to reflect the adaptive change in the user preferences for each aspect of the targeted product in terms of the rating and review. The results of experiments performed on three public datasets demonstrate that the proposed model notably outperforms the other state-of-the-art baselines, thereby validating the effectiveness of the proposed approach.

Download Full-text

A Novel Model for Imbalanced Data Classification

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6145 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6680-6687

Author(s):

Jian Yin ◽

Chunjing Gan ◽

Kaiqi Zhao ◽

Xuan Lin ◽

Zhe Quan ◽

...

Keyword(s):

Imbalanced Data ◽

Data Classification ◽

Classification Performance ◽

Classification Model ◽

Proposed Model ◽

Imbalanced Data Classification ◽

Public Datasets ◽

Distribution Cost ◽

Novel Model ◽

Learning Data

Recently, imbalanced data classification has received much attention due to its wide applications. In the literature, existing researches have attempted to improve the classification performance by considering various factors such as the imbalanced distribution, cost-sensitive learning, data space improvement, and ensemble learning. Nevertheless, most of the existing methods focus on only part of these main aspects/factors. In this work, we propose a novel imbalanced data classification model that considers all these main aspects. To evaluate the performance of our proposed model, we have conducted experiments based on 14 public datasets. The results show that our model outperforms the state-of-the-art methods in terms of recall, G-mean, F-measure and AUC.

Download Full-text

Learning Personalized End-to-End Goal-Oriented Dialog

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016794 ◽

2019 ◽

Vol 33 ◽

pp. 6794-6801 ◽

Cited By ~ 6

Author(s):

Liangchen Luo ◽

Wenhao Huang ◽

Qi Zeng ◽

Zaiqing Nie ◽

Xu Sun

Keyword(s):

User Satisfaction ◽

State Of The Art ◽

User Preferences ◽

Task Completion ◽

Dialog Systems ◽

Performance Improvements ◽

Human Evaluation ◽

Proposed Model ◽

End To End ◽

Qualitative Performance

Most existing works on dialog systems only consider conversation content while neglecting the personality of the user the bot is interacting with, which begets several unsolved issues. In this paper, we present a personalized end-to-end model in an attempt to leverage personalization in goal-oriented dialogs. We first introduce a PROFILE MODEL which encodes user profiles into distributed embeddings and refers to conversation history from other similar users. Then a PREFERENCE MODEL captures user preferences over knowledge base entities to handle the ambiguity in user requests. The two models are combined into the PERSONALIZED MEMN2N. Experiments show that the proposed model achieves qualitative performance improvements over state-of-the-art methods. As for human evaluation, it also outperforms other approaches in terms of task completion rate and user satisfaction.

Download Full-text

Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/435 ◽

2017 ◽

Cited By ~ 141

Author(s):

Jun Xiao ◽

Hao Ye ◽

Xiangnan He ◽

Hanwang Zhang ◽

Fei Wu ◽

...

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Feature Interaction ◽

Model Parameters ◽

Learning Approach ◽

Attention Networks ◽

Feature Interactions ◽

Factorization Machine ◽

Real World Datasets ◽

Novel Model

Factorization Machines (FMs) are a supervised learning approach that enhances the linear regression model by incorporating the second-order feature interactions. Despite effectiveness, FM can be hindered by its modelling of all feature interactions with the same weight, as not all feature interactions are equally useful and predictive. For example, the interactions with useless features may even introduce noises and adversely degrade the performance. In this work, we improve FM by discriminating the importance of different feature interactions. We propose a novel model named Attentional Factorization Machine (AFM), which learns the importance of each feature interaction from data via a neural attention network. Extensive experiments on two real-world datasets demonstrate the effectiveness of AFM. Empirically, it is shown on regression task AFM betters FM with a 8.6% relative improvement, and consistently outperforms the state-of-the-art deep learning methods Wide&Deep [Cheng et al., 2016] and DeepCross [Shan et al., 2016] with a much simpler structure and fewer model parameters. Our implementation of AFM is publicly available at: https://github.com/hexiangnan/attentional_factorization_machine

Download Full-text

Where to Go Next: Modeling Long- and Short-Term User Preferences for Point-of-Interest Recommendation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5353 ◽

2020 ◽

Vol 34 (01) ◽

pp. 214-221 ◽

Cited By ~ 3

Author(s):

Ke Sun ◽

Tieyun Qian ◽

Tong Chen ◽

Yile Liang ◽

Quoc Viet Hung Nguyen ◽

...

Keyword(s):

State Of The Art ◽

User Preferences ◽

Short Term ◽

Preference Modeling ◽

Point Of Interest ◽

Proposed Model ◽

Poi Recommendation ◽

Novel Method ◽

Real World Datasets

Point-of-Interest (POI) recommendation has been a trending research topic as it generates personalized suggestions on facilities for users from a large number of candidate venues. Since users' check-in records can be viewed as a long sequence, methods based on recurrent neural networks (RNNs) have recently shown promising applicability for this task. However, existing RNN-based methods either neglect users' long-term preferences or overlook the geographical relations among recently visited POIs when modeling users' short-term preferences, thus making the recommendation results unreliable. To address the above limitations, we propose a novel method named Long- and Short-Term Preference Modeling (LSTPM) for next-POI recommendation. In particular, the proposed model consists of a nonlocal network for long-term preference modeling and a geo-dilated RNN for short-term preference learning. Extensive experiments on two real-world datasets demonstrate that our model yields significant improvements over the state-of-the-art methods.

Download Full-text

A Deep Learning Based Approach for Localization and Recognition of Pakistani Vehicle License Plates

Sensors ◽

10.3390/s21227696 ◽

2021 ◽

Vol 21 (22) ◽

pp. 7696

Author(s):

Umair Yousaf ◽

Ahmad Khan ◽

Hazrat Ali ◽

Fiaz Gul Khan ◽

Zia ur Rehman ◽

...

Keyword(s):

Deep Learning ◽

State Of The Art ◽

The Other ◽

License Plate ◽

Standard Size ◽

Current State ◽

Bounding Box ◽

Proposed Model ◽

Plate Area

License plate localization is the process of finding the license plate area and drawing a bounding box around it, while recognition is the process of identifying the text within the bounding box. The current state-of-the-art license plate localization and recognition approaches require license plates of standard size, style, fonts, and colors. Unfortunately, in Pakistan, license plates are non-standard and vary in terms of the characteristics mentioned above. This paper presents a deep-learning-based approach to localize and recognize Pakistani license plates with non-uniform and non-standardized sizes, fonts, and styles. We developed a new Pakistani license plate dataset (PLPD) to train and evaluate the proposed model. We conducted extensive experiments to compare the accuracy of the proposed approach with existing techniques. The results show that the proposed method outperformed the other methods to localize and recognize non-standard license plates.

Download Full-text

Topic Enhanced Controllable CVAE for Dialogue Generation (Student Abstract)

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i10.7250 ◽

2020 ◽

Vol 34 (10) ◽

pp. 13955-13956

Author(s):

Yiru Wang ◽

Pengda Si ◽

Zeyang Lei ◽

Yujiu Yang

Keyword(s):

Latent Variable ◽

Experimental Results ◽

The Other ◽

Topic Knowledge ◽

Other Hand ◽

Proposed Model ◽

The One ◽

Public Datasets

Neural generation models have shown great potential in conversation generation recently. However, these methods tend to generate uninformative or irrelevant responses. In this paper, we present a novel topic-enhanced controllable CVAE (TEC-CVAE) model to address this issue. On the one hand, the model learns the context-interactive topic knowledge through a novel multi-hop hybrid attention in the encoder. On the other hand, we design a topic-aware controllable decoder to constrain the expression of the stochastic latent variable in the CVAE to reduce irrelevant responses. Experimental results on two public datasets show that the two mechanisms synchronize to improve both relevance and diversity, and the proposed model outperforms other competitive methods.

Download Full-text

Latent Opinions Transfer Network for Target-Oriented Opinion Words Extraction

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6469 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9298-9305

Author(s):

Zhen Wu ◽

Fei Zhao ◽

Xin-Yu Dai ◽

Shujian Huang ◽

Jiajun Chen

Keyword(s):

Neural Network ◽

Transfer Process ◽

State Of The Art ◽

Transformation Method ◽

Sentiment Classification ◽

Experimental Results ◽

Online Review ◽

Neural Models ◽

Network Methods ◽

Novel Model

Target-oriented opinion words extraction (TOWE) is a new subtask of ABSA, which aims to extract the corresponding opinion words for a given opinion target in a sentence. Recently, neural network methods have been applied to this task and achieve promising results. However, the difficulty of annotation causes the datasets of TOWE to be insufficient, which heavily limits the performance of neural models. By contrast, abundant review sentiment classification data are easily available at online review sites. These reviews contain substantial latent opinions information and semantic patterns. In this paper, we propose a novel model to transfer these opinions knowledge from resource-rich review sentiment classification datasets to low-resource task TOWE. To address the challenges in the transfer process, we design an effective transformation method to obtain latent opinions, then integrate them into TOWE. Extensive experimental results show that our model achieves better performance compared to other state-of-the-art methods and significantly outperforms the base model without transferring opinions knowledge. Further analysis validates the effectiveness of our model.

Download Full-text

GCAEMDA: Predicting miRNA-disease associations via graph convolutional autoencoder

PLoS Computational Biology ◽

10.1371/journal.pcbi.1009655 ◽

2021 ◽

Vol 17 (12) ◽

pp. e1009655

Author(s):

Lei Li ◽

Yu-Tian Wang ◽

Cun-Mei Ji ◽

Chun-Hou Zheng ◽

Jian-Cheng Ni ◽

...

Keyword(s):

Cross Validation ◽

State Of The Art ◽

Human Diseases ◽

Biological Processes ◽

Proposed Model ◽

Disease Associations ◽

Convolutional Autoencoder ◽

Non Coding Rnas ◽

Novel Model ◽

Better Than

microRNAs (miRNAs) are small non-coding RNAs related to a number of complicated biological processes. A growing body of studies have suggested that miRNAs are closely associated with many human diseases. It is meaningful to consider disease-related miRNAs as potential biomarkers, which could greatly contribute to understanding the mechanisms of complex diseases and benefit the prevention, detection, diagnosis and treatment of extraordinary diseases. In this study, we presented a novel model named Graph Convolutional Autoencoder for miRNA-Disease Association Prediction (GCAEMDA). In the proposed model, we utilized miRNA-miRNA similarities, disease-disease similarities and verified miRNA-disease associations to construct a heterogeneous network, which is applied to learn the embeddings of miRNAs and diseases. In addition, we separately constructed miRNA-based and disease-based sub-networks. Combining the embeddings of miRNAs and diseases, graph convolution autoencoder (GCAE) is utilized to calculate association scores of miRNA-disease on two sub-networks, respectively. Furthermore, we obtained final prediction scores between miRNAs and diseases by adopting an average ensemble way to integrate the prediction scores from two types of subnetworks. To indicate the accuracy of GCAEMDA, we applied different cross validation methods to evaluate our model whose performance were better than the state-of-the-art models. Case studies on a common human diseases were also implemented to prove the effectiveness of GCAEMDA. The results demonstrated that GCAEMDA were beneficial to infer potential associations of miRNA-disease.

Download Full-text

A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/619 ◽

2018 ◽

Cited By ~ 17

Author(s):

Li Wang ◽

Junlin Yao ◽

Yunzhe Tao ◽

Li Zhong ◽

Wei Liu ◽

...

Keyword(s):

Deep Learning ◽

Experimental Evaluation ◽

State Of The Art ◽

Text Summarization ◽

The Other ◽

Learning Approach ◽

Automatic Summarization ◽

Word Level ◽

Proposed Model ◽

Abstractive Summarization

In this paper, we propose a deep learning approach to tackle the automatic summarization tasks by incorporating topic information into the convolutional sequence-to-sequence (ConvS2S) model and using self-critical sequence training (SCST) for optimization. Through jointly attending to topics and word-level alignment, our approach can improve coherence, diversity, and informativeness of generated summaries via a biased probability generation mechanism. On the other hand, reinforcement training, like SCST, directly optimizes the proposed model with respect to the non-differentiable metric ROUGE, which also avoids the exposure bias during inference. We carry out the experimental evaluation with state-of-the-art methods over the Gigaword, DUC-2004, and LCSTS datasets. The empirical results demonstrate the superiority of our proposed method in the abstractive summarization.

Download Full-text

Guided Attention Network for Concept Extraction

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/200 ◽

2021 ◽

Author(s):

Songtao Fang ◽

Zhenya Huang ◽

Ming He ◽

Shiwei Tong ◽

Xiaoqing Huang ◽

...

Keyword(s):

Conditional Random Field ◽

The Other ◽

Context Word ◽

Attention Networks ◽

Large Margin ◽

Concept Extraction ◽

Aggregate Information ◽

Textual Data ◽

Structured Information ◽

Novel Model

Concept extraction aims to find words or phrases describing a concept from massive texts. Recently, researchers propose many neural network-based methods to automatically extract concepts. Although these methods for this task show promising results, they ignore structured information in the raw textual data (e.g., title, topic, and clue words). In this paper, we propose a novel model, named Guided Attention Concept Extraction Network (GACEN), which uses title, topic, and clue words as additional supervision to provide guidance directly. Specifically, GACEN comprises two attention networks, one of them is to gather the relevant title and topic information for each context word in the document. The other one aims to model the implicit connection between informative words (clue words) and concepts. Finally, we aggregate information from two networks as input to Conditional Random Field (CRF) to model dependencies in the output. We collected clue words for three well-studied datasets. Extensive experiments demonstrate that our model outperforms the baseline models with a large margin, especially when the labeled data is insufficient.

Download Full-text