KGGCN: Knowledge-Guided Graph Convolutional Networks for Distantly Supervised Relation Extraction

Distantly supervised relation extraction is the most popular technique for identifying semantic relation between two entities. Most prior models only focus on the supervision information present in training sentences. In addition to training sentences, external lexical resource and knowledge graphs often contain other relevant prior knowledge. However, relation extraction models usually ignore such readily available information. Moreover, previous works only utilize a selective attention mechanism over sentences to alleviate the impact of noise, they lack the consideration of the implicit interaction between sentences with relation facts. In this paper, (1) a knowledge-guided graph convolutional network is proposed based on the word-level attention mechanism to encode the sentences. It can capture the key words and cue phrases to generate expressive sentence-level features by attending to the relation indicators obtained from the external lexical resource. (2) A knowledge-guided sentence selector is proposed, which explores the semantic and structural information of triples from knowledge graph as sentence-level knowledge attention to distinguish the importance of each individual sentence. Experimental results on two widely used datasets, NYT-FB and GDS, show that our approach is able to efficiently use the prior knowledge from the external lexical resource and knowledge graph to enhance the performance of distantly supervised relation extraction.

Download Full-text

Attention-Based Graph Convolutional Network for Zero-Shot Learning with Pre-Training

Mathematical Problems in Engineering ◽

10.1155/2021/7480712 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Xuefei Wu ◽

Mingjiang Liu ◽

Bo Xin ◽

Zhangqing Zhu ◽

Gang Wang

Keyword(s):

Loss Function ◽

Attention Mechanism ◽

Propagation Model ◽

Knowledge Graph ◽

Learning Paradigm ◽

Convolutional Network ◽

Dense Graph ◽

Convolutional Networks ◽

Central Node ◽

Graph Propagation

Zero-shot learning (ZSL) is a powerful and promising learning paradigm for classifying instances that have not been seen in training. Although graph convolutional networks (GCNs) have recently shown great potential for the ZSL tasks, these models cannot adjust the constant connection weights between the nodes in knowledge graph and the neighbor nodes contribute equally to classify the central node. In this study, we apply an attention mechanism to adjust the connection weights adaptively to learn more important information for classifying unseen target nodes. First, we propose an attention graph convolutional network for zero-shot learning (AGCNZ) by integrating the attention mechanism and GCN directly. Then, in order to prevent the dilution of knowledge from distant nodes, we apply the dense graph propagation (DGP) model for the ZSL tasks and propose an attention dense graph propagation model for zero-shot learning (ADGPZ). Finally, we propose a modified loss function with a relaxation factor to further improve the performance of the learned classifier. Experimental results under different pre-training settings verified the effectiveness of the proposed attention-based models for ZSL.

Download Full-text

Efficient End-to-End Sentence-Level Lipreading with Temporal Convolutional Networks

Applied Sciences ◽

10.3390/app11156975 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6975

Author(s):

Tao Zhang ◽

Lun He ◽

Xudong Li ◽

Guoqing Feng

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Error Rates ◽

Convolutional Network ◽

Convolutional Networks ◽

Sentence Level ◽

End To End ◽

High Level ◽

Improved Accuracy ◽

Talking Face

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.

Download Full-text

Aspect-level sentiment analysis merged with knowledge graph and graph convolutional neural network

Journal of Physics Conference Series ◽

10.1088/1742-6596/2083/4/042044 ◽

2021 ◽

Vol 2083 (4) ◽

pp. 042044

Author(s):

Zuhua Dai ◽

Yuanyuan Liu ◽

Shilong Di ◽

Qi Fan

Keyword(s):

Neural Network ◽

Sentiment Analysis ◽

Structural Information ◽

Knowledge Graph ◽

Convolutional Network ◽

Text Data ◽

Short Text ◽

Fine Grained ◽

Syntactic Information ◽

Text Information

Abstract Aspect level sentiment analysis belongs to fine-grained sentiment analysis, w hich has caused extensive research in academic circles in recent years. For this task, th e recurrent neural network (RNN) model is usually used for feature extraction, but the model cannot effectively obtain the structural information of the text. Recent studies h ave begun to use the graph convolutional network (GCN) to model the syntactic depen dency tree of the text to solve this problem. For short text data, the text information is not enough to accurately determine the emotional polarity of the aspect words, and the knowledge graph is not effectively used as external knowledge that can enrich the sem antic information. In order to solve the above problems, this paper proposes a graph co nvolutional neural network (GCN) model that can process syntactic information, know ledge graphs and text semantic information. The model works on the “syntax-knowled ge” graph to extract syntactic information and common sense information at the same t ime. Compared with the latest model, the model in this paper can effectively improve t he accuracy of aspect-level sentiment classification on two datasets.

Download Full-text

Adaptive Graph Convolutional Networks with Attention Mechanism for Relation Extraction

2020 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn48605.2020.9207706 ◽

2020 ◽

Author(s):

Zhixin Li ◽

Yara Sun ◽

Suqin Tang ◽

Canlong Zhang ◽

Huifang Ma

Keyword(s):

Relation Extraction ◽

Attention Mechanism ◽

Convolutional Networks

Download Full-text

Software Knowledge Entity Relation Extraction with Entity-Aware and Syntactic Dependency Structure Information

Scientific Programming ◽

10.1155/2021/7466114 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Mingjing Tang ◽

Tong Li ◽

Wei Wang ◽

Rui Zhu ◽

Zifei Ma ◽

...

Keyword(s):

Large Scale ◽

Complex Structure ◽

Relation Extraction ◽

Knowledge Graph ◽

Knowledge Domain ◽

Convolutional Network ◽

Knowledge Community ◽

Structure Information ◽

Entity Relation Extraction ◽

Syntactic Dependency

Software knowledge community contains a large scale of software knowledge entities with complex structure and rich semantic relations. Semantic relation extraction of software knowledge entities is a critical task for software knowledge graph construction, which has an important impact on knowledge graph based tasks such as software document generation and software expert recommendation. Due to the problems of entity sparsity, relation ambiguity, and the lack of annotated dataset in user-generated content of software knowledge community, it is difficult to apply existing methods of relation extraction in the software knowledge domain. To address these issues, we propose a novel software knowledge entity relation extraction model which incorporates entity-aware information with syntactic dependency information. Bidirectional Gated Recurrent Unit (Bi-GRU) and Graph Convolutional Networks (GCN) are used to learn the features of contextual semantic representation and syntactic dependency representation, respectively. To obtain more syntactic dependency information, a weight graph convolutional network based on Newton’s cooling law is constructed by calculating a weight adjacency matrix. Specifically, an entity-aware attention mechanism is proposed to integrate the entity information and syntactic dependency information to improve the prediction performance of the model. Experiments are conducted on a dataset which is constructed based on texts of the StackOverflow and show that the proposed model has better performance than the benchmark models.

Download Full-text

Spatial Relational Attention Using Fully Convolutional Networks for Image Caption Generation

International Journal of Computational Intelligence and Applications ◽

10.1142/s146902682050011x ◽

2020 ◽

Vol 19 (02) ◽

pp. 2050011

Author(s):

Teng Jiang ◽

Liang Gong ◽

Yupu Yang

Keyword(s):

Spatial Relations ◽

Image Features ◽

Attention Mechanism ◽

Convolutional Network ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Benchmark Datasets ◽

Visual Concepts ◽

Image Caption Generation ◽

Image Caption

Attention-based encoder–decoder framework has greatly improved image caption generation tasks. The attention mechanism plays a transitional role by transforming static image features into sequential captions. To generate reasonable captions, it is of great significance to detect spatial characteristics of images. In this paper, we propose a spatial relational attention approach to consider spatial positions and attributes. Image features are firstly weighted by the attention mechanism. Then they are concatenated with contextual features to form a spatial–visual tensor. The tensor is feature extracted by a fully convolutional network to produce visual concepts for the decoder network. The fully convolutional layers maintain spatial topology of images. Experiments conducted on the three benchmark datasets, namely Flickr8k, Flickr30k and MSCOCO, demonstrate the effectiveness of our proposed approach. Captions generated by the spatial relational attention method precisely capture spatial relations of objects.

Download Full-text

Relation Extraction with Convolutional Network over Learnable Syntax-Transport Graph

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6423 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8928-8935

Author(s):

Kai Sun ◽

Richong Zhang ◽

Yongyi Mao ◽

Samuel Mensah ◽

Xudong Liu

Keyword(s):

Relation Extraction ◽

Weighted Graph ◽

Relevant Information ◽

Irrelevant Information ◽

Classification Task ◽

Convolutional Network ◽

Structure Information ◽

Convolutional Networks ◽

Dependency Tree ◽

Relation Classification

A large majority of approaches have been proposed to leverage the dependency tree in the relation classification task. Recent works have focused on pruning irrelevant information from the dependency tree. The state-of-the-art Attention Guided Graph Convolutional Networks (AGGCNs) transforms the dependency tree into a weighted-graph to distinguish the relevance of nodes and edges for relation classification. However, in their approach, the graph is fully connected, which destroys the structure information of the original dependency tree. How to effectively make use of relevant information while ignoring irrelevant information from the dependency trees remains a challenge in the relation classification task. In this work, we learn to transform the dependency tree into a weighted graph by considering the syntax dependencies of the connected nodes and persisting the structure of the original dependency tree. We refer to this graph as a syntax-transport graph. We further propose a learnable syntax-transport attention graph convolutional network (LST-AGCN) which operates on the syntax-transport graph directly to distill the final representation which is sufficient for classification. Experiments on Semeval-2010 Task 8 and Tacred show our approach outperforms previous methods.

Download Full-text

I Know the Relationships: Zero-Shot Action Recognition via Two-Stream Graph Convolutional Networks and Knowledge Graphs

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018303 ◽

2019 ◽

Vol 33 ◽

pp. 8303-8311 ◽

Cited By ~ 13

Author(s):

Junyu Gao ◽

Tianzhu Zhang ◽

Changsheng Xu

Keyword(s):

Action Recognition ◽

Visual Cues ◽

Feature Space ◽

Experimental Results ◽

Knowledge Graph ◽

Convolutional Network ◽

Convolutional Networks ◽

Semantic Embedding ◽

Structured Knowledge ◽

Knowledge Graphs

Recently, with the ever-growing action categories, zero-shot action recognition (ZSAR) has been achieved by automatically mining the underlying concepts (e.g., actions, attributes) in videos. However, most existing methods only exploit the visual cues of these concepts but ignore external knowledge information for modeling explicit relationships between them. In fact, humans have remarkable ability to transfer knowledge learned from familiar classes to recognize unfamiliar classes. To narrow the knowledge gap between existing methods and humans, we propose an end-to-end ZSAR framework based on a structured knowledge graph, which can jointly model the relationships between action-attribute, action-action, and attribute-attribute. To effectively leverage the knowledge graph, we design a novel Two-Stream Graph Convolutional Network (TS-GCN) consisting of a classifier branch and an instance branch. Specifically, the classifier branch takes the semantic-embedding vectors of all the concepts as input, then generates the classifiers for action categories. The instance branch maps the attribute embeddings and scores of each video instance into an attribute-feature space. Finally, the generated classifiers are evaluated on the attribute features of each video, and a classification loss is adopted for optimizing the whole network. In addition, a self-attention module is utilized to model the temporal information of videos. Extensive experimental results on three realistic action benchmarks Olympic Sports, HMDB51 and UCF101 demonstrate the favorable performance of our proposed framework.

Download Full-text

Densely Connected Graph Convolutional Networks for Graph-to-Sequence Learning

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00269 ◽

2019 ◽

Vol 7 ◽

pp. 297-312 ◽

Cited By ~ 3

Author(s):

Zhijiang Guo ◽

Yan Zhang ◽

Zhiyang Teng ◽

Wei Lu

Keyword(s):

Sequence Learning ◽

Connected Graph ◽

Structural Information ◽

Text Generation ◽

Structural Representation ◽

Convolutional Network ◽

Neural Machine Translation ◽

Convolutional Networks ◽

Deep Architecture ◽

Non Local

We focus on graph-to-sequence learning, which can be framed as transducing graph structures to sequences for text generation. To capture structural information associated with graphs, we investigate the problem of encoding graphs using graph convolutional networks (GCNs). Unlike various existing approaches where shallow architectures were used for capturing local structural information only, we introduce a dense connection strategy, proposing a novel Densely Connected Graph Convolutional Network (DCGCN). Such a deep architecture is able to integrate both local and non-local features to learn a better structural representation of a graph. Our model outperforms the state-of-the-art neural models significantly on AMR-to-text generation and syntax-based neural machine translation.

Download Full-text

PM2.5 Concentration Forecasting in Industrial Parks Based on Attention Mechanism Spatiotemporal Graph Convolutional Networks

Wireless Communications and Mobile Computing ◽

10.1155/2021/7000986 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Qingtian Zeng ◽

Chao Wang ◽

Geng Chen ◽

Hua Duan

Keyword(s):

Mean Squared Error ◽

Attention Mechanism ◽

Percentage Error ◽

Industrial Park ◽

Learning Technology ◽

Concentration Data ◽

Convolutional Network ◽

Industrial Parks ◽

The Impact ◽

Pm2.5 Concentration

Industrial parks are one of the main sources of air pollution; the ability to forecast PM2.5, the main pollutant in the industrial park, is of great significance to the health of the workers in the industrial park and environmental governance, which can improve the decision-making ability of environmental management. Most of the existing PM2.5 concentration forecast methods lack the ability to model the dynamic temporal and spatial correlations of PM2.5 concentration. In an industrial park environment, in order to improve the accuracy of PM2.5 concentration forecast, based on deep learning technology, this paper proposes a spatiotemporal graph convolutional network based on the attention mechanism (STAM-STGCN) to solve the PM2.5 concentration forecast problem. When constructing the adjacency matrix, we not only use the Euclidean distance between sites but also consider the impact of wind fields and the impact of pollution sources near the nodes. In the process of model construction, we first use the spatiotemporal attention mechanism to capture the dynamic spatiotemporal correlations in PM2.5 data. In the spatiotemporal convolution module, we use graph convolutional neural networks to capture spatial features and standard convolution to describe temporal features. Finally, the output module adjusts the output shape of the data to produce the final forecast result. In this paper, the mean absolute error (MAE), root mean squared error (RMSE), and mean absolute percentage error (MAPE) are used as the performance evaluation metrics of the model, and the Dongmingnan Industrial Park atmospheric dataset is used to verify the effectiveness of the proposed algorithm. The experimental results show that our STAM-STGCN model can more fully capture the spatial-temporal characteristics of PM2.5 concentration data; compared with the most advanced model in the comparison model, the RMSE can be improved about 24.2%, the MAE is improved about 35.8%, and the MAPE is improved about 34.6%.

Download Full-text