KRAN: Knowledge Refining Attention Network for Recommendation

Zhenyu Zhang; Lei Zhang; Dingqi Yang; Liu Yang

doi:10.1145/3470783

KRAN: Knowledge Refining Attention Network for Recommendation

ACM Transactions on Knowledge Discovery from Data ◽

10.1145/3470783 ◽

2022 ◽

Vol 16 (2) ◽

pp. 1-20

Author(s):

Zhenyu Zhang ◽

Lei Zhang ◽

Dingqi Yang ◽

Liu Yang

Keyword(s):

State Of The Art ◽

Negative Impact ◽

Cold Start ◽

Attention Mechanism ◽

Knowledge Graph ◽

Convolutional Network ◽

Auxiliary Data ◽

Attention Network ◽

Additional Information ◽

Data Source

Recommender algorithms combining knowledge graph and graph convolutional network are becoming more and more popular recently. Specifically, attributes describing the items to be recommended are often used as additional information. These attributes along with items are highly interconnected, intrinsically forming a Knowledge Graph (KG). These algorithms use KGs as an auxiliary data source to alleviate the negative impact of data sparsity. However, these graph convolutional network based algorithms do not distinguish the importance of different neighbors of entities in the KG, and according to Pareto’s principle, the important neighbors only account for a small proportion. These traditional algorithms can not fully mine the useful information in the KG. To fully release the power of KGs for building recommender systems, we propose in this article KRAN, a Knowledge Refining Attention Network, which can subtly capture the characteristics of the KG and thus boost recommendation performance. We first introduce a traditional attention mechanism into the KG processing, making the knowledge extraction more targeted, and then propose a refining mechanism to improve the traditional attention mechanism to extract the knowledge in the KG more effectively. More precisely, KRAN is designed to use our proposed knowledge-refining attention mechanism to aggregate and obtain the representations of the entities (both attributes and items) in the KG. Our knowledge-refining attention mechanism first measures the relevance between an entity and it’s neighbors in the KG by attention coefficients, and then further refines the attention coefficients using a “richer-get-richer” principle, in order to focus on highly relevant neighbors while eliminating less relevant neighbors for noise reduction. In addition, for the item cold start problem, we propose KRAN-CD, a variant of KRAN, which further incorporates pre-trained KG embeddings to handle cold start items. Experiments show that KRAN and KRAN-CD consistently outperform state-of-the-art baselines across different settings.

Download Full-text

Graph Contextualized Self-Attention Network for Session-based Recommendation

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/547 ◽

2019 ◽

Cited By ~ 20

Author(s):

Chengfeng Xu ◽

Pengpeng Zhao ◽

Yanchi Liu ◽

Victor S. Sheng ◽

Jiajie Xu ◽

...

Keyword(s):

Neural Network ◽

State Of The Art ◽

Attention Mechanism ◽

The Self ◽

Graph Structure ◽

Convolutional Network ◽

Attention Network ◽

Attention Model ◽

Sequence Modeling ◽

Real World Datasets

Session-based recommendation, which aims to predict the user's immediate next action based on anonymous sessions, is a key task in many online services (e.g., e-commerce, media streaming). Recently, Self-Attention Network (SAN) has achieved significant success in various sequence modeling tasks without using either recurrent or convolutional network. However, SAN lacks local dependencies that exist over adjacent items and limits its capacity for learning contextualized representations of items in sequences. In this paper, we propose a graph contextualized self-attention model (GC-SAN), which utilizes both graph neural network and self-attention mechanism, for session-based recommendation. In GC-SAN, we dynamically construct a graph structure for session sequences and capture rich local dependencies via graph neural network (GNN). Then each session learns long-range dependencies by applying the self-attention mechanism. Finally, each session is represented as a linear combination of the global preference and the current interest of that session. Extensive experiments on two real-world datasets show that GC-SAN outperforms state-of-the-art methods consistently.

Download Full-text

Relational Graph Neural Network with Hierarchical Attention for Knowledge Graph Completion

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6508 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9612-9619

Author(s):

Zhao Zhang ◽

Fuzhen Zhuang ◽

Hengshu Zhu ◽

Zhiping Shi ◽

Hui Xiong ◽

...

Keyword(s):

Neural Network ◽

Missing Values ◽

State Of The Art ◽

Attention Mechanism ◽

Knowledge Graph ◽

Incomplete Knowledge ◽

Neighborhood Information ◽

Local Neighborhood ◽

Proposed Model ◽

Knowledge Graphs

The rapid proliferation of knowledge graphs (KGs) has changed the paradigm for various AI-related applications. Despite their large sizes, modern KGs are far from complete and comprehensive. This has motivated the research in knowledge graph completion (KGC), which aims to infer missing values in incomplete knowledge triples. However, most existing KGC models treat the triples in KGs independently without leveraging the inherent and valuable information from the local neighborhood surrounding an entity. To this end, we propose a Relational Graph neural network with Hierarchical ATtention (RGHAT) for the KGC task. The proposed model is equipped with a two-level attention mechanism: (i) the first level is the relation-level attention, which is inspired by the intuition that different relations have different weights for indicating an entity; (ii) the second level is the entity-level attention, which enables our model to highlight the importance of different neighboring entities under the same relation. The hierarchical attention mechanism makes our model more effective to utilize the neighborhood information of an entity. Finally, we extensively validate the superiority of RGHAT against various state-of-the-art baselines.

Download Full-text

End-to-End Structure-Aware Convolutional Networks for Knowledge Base Completion

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013060 ◽

2019 ◽

Vol 33 ◽

pp. 3060-3067 ◽

Cited By ~ 20

Author(s):

Chao Shang ◽

Yun Tang ◽

Jing Huang ◽

Jinbo Bi ◽

Xiaodong He ◽

...

Keyword(s):

Knowledge Base ◽

State Of The Art ◽

The State ◽

Graph Connectivity ◽

Knowledge Graph ◽

Graph Node ◽

Convolutional Network ◽

Node Attributes ◽

Knowledge Graphs ◽

End To End

Knowledge graph embedding has been an active research topic for knowledge base completion, with progressive improvement from the initial TransE, TransH, DistMult et al to the current state-of-the-art ConvE. ConvE uses 2D convolution over embeddings and multiple layers of nonlinear features to model knowledge graphs. The model can be efficiently trained and scalable to large knowledge graphs. However, there is no structure enforcement in the embedding space of ConvE. The recent graph convolutional network (GCN) provides another way of learning graph node embedding by successfully utilizing graph connectivity structure. In this work, we propose a novel end-to-end StructureAware Convolutional Network (SACN) that takes the benefit of GCN and ConvE together. SACN consists of an encoder of a weighted graph convolutional network (WGCN), and a decoder of a convolutional network called Conv-TransE. WGCN utilizes knowledge graph node structure, node attributes and edge relation types. It has learnable weights that adapt the amount of information from neighbors used in local aggregation, leading to more accurate embeddings of graph nodes. Node attributes in the graph are represented as additional nodes in the WGCN. The decoder Conv-TransE enables the state-of-the-art ConvE to be translational between entities and relations while keeps the same link prediction performance as ConvE. We demonstrate the effectiveness of the proposed SACN on standard FB15k-237 and WN18RR datasets, and it gives about 10% relative improvement over the state-of-theart ConvE in terms of HITS@1, HITS@3 and HITS@10.

Download Full-text

SPMC: Socially-Aware Personalized Markov Chains for Sparse Sequential Recommendation

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/204 ◽

2017 ◽

Cited By ~ 13

Author(s):

Chenwei Cai ◽

Ruining He ◽

Julian McAuley

Keyword(s):

Markov Chains ◽

Social Relationships ◽

Real World ◽

Matrix Factorization ◽

State Of The Art ◽

Cold Start ◽

Additional Information ◽

New Methods ◽

Real World Datasets ◽

Sequential Information

Dealing with sparse, long-tailed datasets, and cold-start problems is always a challenge for recommender systems. These issues can partly be dealt with by making predictions not in isolation, but by leveraging information from related events; such information could include signals from social relationships or from the sequence of recent activities. Both types of additional information can be used to improve the performance of state-of-the-art matrix factorization-based techniques. In this paper, we propose new methods to combine both social and sequential information simultaneously, in order to further improve recommendation performance. We show these techniques to be particularly effective when dealing with sparsity and cold-start issues in several large, real-world datasets.

Download Full-text

Attention-Based Graph Convolutional Network for Zero-Shot Learning with Pre-Training

Mathematical Problems in Engineering ◽

10.1155/2021/7480712 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Xuefei Wu ◽

Mingjiang Liu ◽

Bo Xin ◽

Zhangqing Zhu ◽

Gang Wang

Keyword(s):

Loss Function ◽

Attention Mechanism ◽

Propagation Model ◽

Knowledge Graph ◽

Learning Paradigm ◽

Convolutional Network ◽

Dense Graph ◽

Convolutional Networks ◽

Central Node ◽

Graph Propagation

Zero-shot learning (ZSL) is a powerful and promising learning paradigm for classifying instances that have not been seen in training. Although graph convolutional networks (GCNs) have recently shown great potential for the ZSL tasks, these models cannot adjust the constant connection weights between the nodes in knowledge graph and the neighbor nodes contribute equally to classify the central node. In this study, we apply an attention mechanism to adjust the connection weights adaptively to learn more important information for classifying unseen target nodes. First, we propose an attention graph convolutional network for zero-shot learning (AGCNZ) by integrating the attention mechanism and GCN directly. Then, in order to prevent the dilution of knowledge from distant nodes, we apply the dense graph propagation (DGP) model for the ZSL tasks and propose an attention dense graph propagation model for zero-shot learning (ADGPZ). Finally, we propose a modified loss function with a relaxation factor to further improve the performance of the learned classifier. Experimental results under different pre-training settings verified the effectiveness of the proposed attention-based models for ZSL.

Download Full-text

End-to-end Relation-Enhanced Learnable Graph Self-attention Network for Knowledge Graphs Embedding

10.21203/rs.3.rs-396932/v1 ◽

2021 ◽

Author(s):

Shengchen Jiang ◽

Hongbin Wang ◽

Xiang Hou

Keyword(s):

Large Scale ◽

Structural Characteristics ◽

Graph Embedding ◽

Knowledge Graph ◽

Data Sets ◽

Relevance Ranking ◽

Convolutional Network ◽

Attention Network ◽

Knowledge Graphs ◽

End To End

Abstract The existing methods ignore the adverse effect of knowledge graph incompleteness on knowledge graph embedding. In addition, the complexity and large-scale of knowledge information hinder knowledge graph embedding performance of the classic graph convolutional network. In this paper, we analyzed the structural characteristics of knowledge graph and the imbalance of knowledge information. Complex knowledge information requires that the model should have better learnability, rather than linearly weighted qualitative constraints, so the method of end-to-end relation-enhanced learnable graph self-attention network for knowledge graphs embedding is proposed. Firstly, we construct the relation-enhanced adjacency matrix to consider the incompleteness of the knowledge graph. Secondly, the graph self-attention network is employed to obtain the global encoding and relevance ranking of entity node information. Thirdly, we propose the concept of convolutional knowledge subgraph, it is constructed according to the entity relevance ranking. Finally, we improve the training effect of the convKB model by changing the construction of negative samples to obtain a better reliability score in the decoder. The experimental results based on the data sets FB15k-237 and WN18RR show that the proposed method facilitates more comprehensive representation of knowledge information than the existing methods, in terms of Hits@10 and MRR.

Download Full-text

An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network

Sensors ◽

10.3390/s21041120 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1120

Author(s):

Lu Meng ◽

Ronghui Li

Keyword(s):

Sign Language ◽

Attention Mechanism ◽

Language Recognition ◽

Sign Language Recognition ◽

Accuracy Rate ◽

Long Distance ◽

Convolutional Network ◽

Attention Network ◽

Multi Scale ◽

Human Finger

Sign language is the most important way of communication for hearing-impaired people. Research on sign language recognition can help normal people understand sign language. We reviewed the classic methods of sign language recognition, and the recognition accuracy is not high enough because of redundant information, human finger occlusion, motion blurring, the diversified signing styles of different people, and so on. To overcome these shortcomings, we propose a multi-scale and dual sign language recognition Network (SLR-Net) based on a graph convolutional network (GCN). The original input data was RGB videos. We first extracted the skeleton data from them and then used the skeleton data for sign language recognition. SLR-Net is mainly composed of three sub-modules: multi-scale attention network (MSA), multi-scale spatiotemporal attention network (MSSTA) and attention enhanced temporal convolution network (ATCN). MSA allows the GCN to learn the dependencies between long-distance vertices; MSSTA can directly learn the spatiotemporal features; ATCN allows the GCN network to better learn the long temporal dependencies. The three different attention mechanisms, multi-scale attention mechanism, spatiotemporal attention mechanism, and temporal attention mechanism, are proposed to further improve the robustness and accuracy. Besides, a keyframe extraction algorithm is proposed, which can greatly improve efficiency by sacrificing a little accuracy. Experimental results showed that our method can reach 98.08% accuracy rate in the CSL-500 dataset with a 500-word vocabulary. Even on the challenging dataset DEVISIGN-L with a 2000-word vocabulary, it also reached a 64.57% accuracy rate, outperforming other state-of-the-art sign language recognition methods.

Download Full-text

KGGCN: Knowledge-Guided Graph Convolutional Networks for Distantly Supervised Relation Extraction

Applied Sciences ◽

10.3390/app11167734 ◽

2021 ◽

Vol 11 (16) ◽

pp. 7734

Author(s):

Ningyi Mao ◽

Wenti Huang ◽

Hai Zhong

Keyword(s):

Prior Knowledge ◽

Structural Information ◽

Relation Extraction ◽

Attention Mechanism ◽

Knowledge Graph ◽

Convolutional Network ◽

Convolutional Networks ◽

Lexical Resource ◽

Sentence Level ◽

The Impact

Distantly supervised relation extraction is the most popular technique for identifying semantic relation between two entities. Most prior models only focus on the supervision information present in training sentences. In addition to training sentences, external lexical resource and knowledge graphs often contain other relevant prior knowledge. However, relation extraction models usually ignore such readily available information. Moreover, previous works only utilize a selective attention mechanism over sentences to alleviate the impact of noise, they lack the consideration of the implicit interaction between sentences with relation facts. In this paper, (1) a knowledge-guided graph convolutional network is proposed based on the word-level attention mechanism to encode the sentences. It can capture the key words and cue phrases to generate expressive sentence-level features by attending to the relation indicators obtained from the external lexical resource. (2) A knowledge-guided sentence selector is proposed, which explores the semantic and structural information of triples from knowledge graph as sentence-level knowledge attention to distinguish the importance of each individual sentence. Experimental results on two widely used datasets, NYT-FB and GDS, show that our approach is able to efficiently use the prior knowledge from the external lexical resource and knowledge graph to enhance the performance of distantly supervised relation extraction.

Download Full-text

Road crack segmentation using an attention residual U-Net with generative adversarial

Mathematical Biosciences and Engineering ◽

10.3934/mbe.2021473 ◽

2021 ◽

Vol 18 (6) ◽

pp. 9669-9684

Author(s):

Xing Hu ◽

◽

Minghui Yao ◽

Dawei Zhang

Keyword(s):

Visual Attention ◽

Loss Function ◽

State Of The Art ◽

The State ◽

Attention Mechanism ◽

Semantic Features ◽

Convolutional Network ◽

Adversarial Learning ◽

Art Methods ◽

Visual Attention Mechanism

<abstract> <p>This paper proposed an end-to-end road crack segmentation model based on attention mechanism and deep FCN with generative adversarial learning. We create a segmentation network by introducing a visual attention mechanism and residual module to a fully convolutional network(FCN) to capture richer local features and more global semantic features and get a better segment result. Besides, we use an adversarial network consisting of convolutional layers as a discrimination network. The main contributions of this work are as follows: 1) We introduce a CNN model as a discriminate network to realize adversarial learning to guide the training of the segmentation network, which is trained in a min-max way: the discrimination network is trained by maximizing the loss function, while the segmentation network is trained with the only gradient passed by the discrimination network and aim at minimizing the loss function, and finally an optimal segmentation network is obtained; 2) We add the residual modular and the visual attention mechanism to U-Net, which makes the segmentation results more robust, refined and smooth; 3) Extensive experiments are conducted on three public road crack datasets to evaluate the performance of our proposed model. Qualitative and quantitative comparisons between the proposed method and the state-of-the-art methods show that the proposed method outperforms or is comparable to the state-of-the-art methods in both F1 score and precision. In particular, compared with U-Net, the mIoU of our proposed method is increased about 3%~17% compared with the three public datasets.</p> </abstract>

Download Full-text

Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/733 ◽

2019 ◽

Cited By ~ 5

Author(s):

Yuting Wu ◽

Xiao Liu ◽

Yansong Feng ◽

Zheng Wang ◽

Rui Yan ◽

...

Keyword(s):

Real World ◽

State Of The Art ◽

Dual Graph ◽

Knowledge Graph ◽

Convolutional Network ◽

Complex Relation ◽

Dual Relation ◽

Knowledge Graphs ◽

Cross Lingual ◽

Heterogeneous Knowledge

Entity alignment is the task of linking entities with the same real-world identity from different knowledge graphs (KGs), which has been recently dominated by embedding-based methods. Such approaches work by learning KG representations so that entity alignment can be performed by measuring the similarities between entity embeddings. While promising, prior works in the field often fail to properly capture complex relation information that commonly exists in multi-relational KGs, leaving much room for improvement. In this paper, we propose a novel Relation-aware Dual-Graph Convolutional Network (RDGCN) to incorporate relation information via attentive interactions between the knowledge graph and its dual relation counterpart, and further capture neighboring structures to learn better entity representations. Experiments on three real-world cross-lingual datasets show that our approach delivers better and more robust results over the state-of-the-art alignment methods by learning better KG representations.

Download Full-text