Attention-Enhanced Graph Convolutional Networks for Aspect-Based Sentiment Classification with Multi-Head Attention

The purpose of aspect-based sentiment classification is to identify the sentiment polarity of each aspect in a sentence. Recently, due to the introduction of Graph Convolutional Networks (GCN), more and more studies have used sentence structure information to establish the connection between aspects and opinion words. However, the accuracy of these methods is limited by noise information and dependency tree parsing performance. To solve this problem, we proposed an attention-enhanced graph convolutional network (AEGCN) for aspect-based sentiment classification with multi-head attention (MHA). Our proposed method can better combine semantic and syntactic information by introducing MHA and GCN. We also added an attention mechanism to GCN to enhance its performance. In order to verify the effectiveness of our proposed method, we conducted a lot of experiments on five benchmark datasets. The experimental results show that our proposed method can make more reasonable use of semantic and syntactic information, and further improve the performance of GCN.

Download Full-text

Targeted Sentiment Classification Based on Attentional Encoding and Graph Convolutional Networks

Applied Sciences ◽

10.3390/app10030957 ◽

2020 ◽

Vol 10 (3) ◽

pp. 957 ◽

Cited By ~ 3

Author(s):

Luwei Xiao ◽

Xiaohui Hu ◽

Yinong Chen ◽

Yun Xue ◽

Donghong Gu ◽

...

Keyword(s):

Neural Networks ◽

Semantic Information ◽

Sentiment Classification ◽

Convolutional Network ◽

Convolutional Networks ◽

Dependency Tree ◽

Proposed Model ◽

Syntactic Information ◽

Specific Goal ◽

State Of Art

Targeted sentiment classification aims to predict the emotional trend of a specific goal. Currently, most methods (e.g., recurrent neural networks and convolutional neural networks combined with an attention mechanism) are not able to fully capture the semantic information of the context and they also lack a mechanism to explain the relevant syntactical constraints and long-range word dependencies. Therefore, syntactically irrelevant context words may mistakenly be recognized as clues to predict the target sentiment. To tackle these problems, this paper considers that the semantic information, syntactic information, and their interaction information are very crucial to targeted sentiment analysis, and propose an attentional-encoding-based graph convolutional network (AEGCN) model. Our proposed model is mainly composed of multi-head attention and an improved graph convolutional network built over the dependency tree of a sentence. Pre-trained BERT is applied to this task, and new state-of-art performance is achieved. Experiments on five datasets show the effectiveness of the model proposed in this paper compared with a series of the latest models.

Download Full-text

Sentence Compression Using BERT and Graph Convolutional Networks

Applied Sciences ◽

10.3390/app11219910 ◽

2021 ◽

Vol 11 (21) ◽

pp. 9910

Author(s):

Yo-Han Park ◽

Gyong-Ho Lee ◽

Yong-Seok Choi ◽

Kong-Joo Lee

Keyword(s):

Language Processing ◽

Large Scale ◽

Convolutional Network ◽

Convolutional Networks ◽

Compression Model ◽

Sentence Compression ◽

Dependency Tree ◽

Proposed Model ◽

Syntactic Information ◽

Input Sentence

Sentence compression is a natural language-processing task that produces a short paraphrase of an input sentence by deleting words from the input sentence while ensuring grammatical correctness and preserving meaningful core information. This study introduces a graph convolutional network (GCN) into a sentence compression task to encode syntactic information, such as dependency trees. As we upgrade the GCN to activate a directed edge, the compression model with the GCN layers can distinguish between parent and child nodes in a dependency tree when aggregating adjacent nodes. Furthermore, by increasing the number of GCN layers, the model can gradually collect high-order information of a dependency tree when propagating node information through the layers. We implement a sentence compression model for Korean and English, respectively. This model consists of three components: pre-trained BERT model, GCN layers, and a scoring layer. The scoring layer can determine whether a word should remain in a compressed sentence by relying on the word vector containing contextual and syntactic information encoded by BERT and GCN layers. To train and evaluate the proposed model, we used the Google sentence compression dataset for English and a Korean sentence compression corpus containing about 140,000 sentence pairs for Korean. The experimental results demonstrate that the proposed model achieves state-of-the-art performance for English. To the best of our knowledge, this sentence compression model based on the deep learning model trained with a large-scale corpus is the first attempt for Korean.

Download Full-text

Relation Extraction with Convolutional Network over Learnable Syntax-Transport Graph

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6423 ◽

2020 ◽

Vol 34 (05) ◽

pp. 8928-8935

Author(s):

Kai Sun ◽

Richong Zhang ◽

Yongyi Mao ◽

Samuel Mensah ◽

Xudong Liu

Keyword(s):

Relation Extraction ◽

Weighted Graph ◽

Relevant Information ◽

Irrelevant Information ◽

Classification Task ◽

Convolutional Network ◽

Structure Information ◽

Convolutional Networks ◽

Dependency Tree ◽

Relation Classification

A large majority of approaches have been proposed to leverage the dependency tree in the relation classification task. Recent works have focused on pruning irrelevant information from the dependency tree. The state-of-the-art Attention Guided Graph Convolutional Networks (AGGCNs) transforms the dependency tree into a weighted-graph to distinguish the relevance of nodes and edges for relation classification. However, in their approach, the graph is fully connected, which destroys the structure information of the original dependency tree. How to effectively make use of relevant information while ignoring irrelevant information from the dependency trees remains a challenge in the relation classification task. In this work, we learn to transform the dependency tree into a weighted graph by considering the syntax dependencies of the connected nodes and persisting the structure of the original dependency tree. We refer to this graph as a syntax-transport graph. We further propose a learnable syntax-transport attention graph convolutional network (LST-AGCN) which operates on the syntax-transport graph directly to distill the final representation which is sufficient for classification. Experiments on Semeval-2010 Task 8 and Tacred show our approach outperforms previous methods.

Download Full-text

Lexical attention and aspect-oriented graph convolutional networks for aspect-based sentiment analysis

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-211045 ◽

2021 ◽

pp. 1-12

Author(s):

Wenwen Li ◽

Shiqun Yin ◽

Ting Pu

Keyword(s):

Sentiment Analysis ◽

Oriented Graph ◽

Attention Mechanism ◽

Experimental Results ◽

The Other ◽

Convolutional Networks ◽

Other Hand ◽

Syntactic Information ◽

Benchmark Datasets ◽

Syntactic Dependencies

The purpose of aspect-based sentiment analysis is to predict the sentiment polarity of different aspects in a text. In previous work, while attention has been paid to the use of Graph Convolutional Networks (GCN) to encode syntactic dependencies in order to exploit syntactic information, previous models have tended to confuse opinion words from different aspects due to the complexity of language and the diversity of aspects. On the other hand, the effect of word lexicality on aspects’ sentiment polarity judgments has not been considered in previous studies. In this paper, we propose lexical attention and aspect-oriented GCN to solve the above problems. First, we construct an aspect-oriented dependency-parsed tree by analyzing and pruning the dependency-parsed tree of the sentence, then use the lexical attention mechanism to focus on the features of the lexical properties that play a key role in determining the sentiment polarity, and finally extract the aspect-oriented lexical weighted features by a GCN.Extensive experimental results on three benchmark datasets demonstrate the effectiveness of our approach.

Download Full-text

Graph Convolutional Networks with Bidirectional Attention for Aspect-Based Sentiment Classification

Applied Sciences ◽

10.3390/app11041528 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1528

Author(s):

Jie Liu ◽

Peiyu Liu ◽

Zhenfang Zhu ◽

Xiaowen Li ◽

Guangtao Xu

Keyword(s):

Attention Mechanism ◽

Sentiment Classification ◽

Long Distance ◽

Convolutional Network ◽

Convolutional Networks ◽

Syntactic Information ◽

Position Coding ◽

Novel Method

Aspect-based sentiment classification aims at determining the corresponding sentiment of a particular aspect. Many sophisticated approaches, such as attention mechanisms and Graph Convolutional Networks, have been widely used to address this challenge. However, most of the previous methods have not well analyzed the role of words and long-distance dependencies, and the interaction between context and aspect terms is not well realized, which greatly limits the effectiveness of the model. In this paper, we propose an effective and novel method using attention mechanism and graph convolutional network (ATGCN). Firstly, we make full use of multi-head attention and point-wise convolution transformation to obtain the hidden state. Secondly, we introduce position coding in the model, and use Graph Convolutional Networks to obtain syntactic information and long-distance dependencies. Finally, the interaction between context and aspect terms is further realized by bidirectional attention. Experiments on three benchmarking collections indicate the effectiveness of ATGCN.

Download Full-text

Learn from Syntax: Improving Pair-wise Aspect and Opinion Terms Extraction with Rich Syntactic Knowledge

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/545 ◽

2021 ◽

Author(s):

Shengqiong Wu ◽

Hao Fei ◽

Yafeng Ren ◽

Donghong Ji ◽

Jingye Li

Keyword(s):

State Of The Art ◽

Boundary Detection ◽

High Order ◽

Experimental Results ◽

Convolutional Network ◽

Syntactic Knowledge ◽

Current State ◽

Syntactic Features ◽

Benchmark Datasets

In this paper, we propose to enhance the pair-wise aspect and opinion terms extraction (PAOTE) task by incorporating rich syntactic knowledge. We first build a syntax fusion encoder for encoding syntactic features, including a label-aware graph convolutional network (LAGCN) for modeling the dependency edges and labels, as well as the POS tags unifiedly, and a local-attention module encoding POS tags for better term boundary detection. During pairing, we then adopt Biaffine and Triaffine scoring for high-order aspect-opinion term pairing, in the meantime re-harnessing the syntax-enriched representations in LAGCN for syntactic-aware scoring. Experimental results on four benchmark datasets demonstrate that our model outperforms current state-of-the-art baselines, meanwhile yielding explainable predictions with syntactic knowledge.

Download Full-text

PolSAR Image Classification with Lightweight 3D Convolutional Networks

Remote Sensing ◽

10.3390/rs12030396 ◽

2020 ◽

Vol 12 (3) ◽

pp. 396

Author(s):

Hongwei Dong ◽

Lamei Zhang ◽

Bin Zou

Keyword(s):

Image Classification ◽

Data Representation ◽

Model Parameters ◽

Structure Information ◽

Convolutional Networks ◽

Optical Images ◽

Benchmark Datasets ◽

3D Cnn ◽

Almost All ◽

Fully Connected

Convolutional neural networks (CNNs) have become the state-of-the-art in optical image processing. Recently, CNNs have been used in polarimetric synthetic aperture radar (PolSAR) image classification and obtained promising results. Unlike optical images, the unique phase information of PolSAR data expresses the structure information of objects. This special data representation makes 3D convolution which explicitly modeling the relationship between polarimetric channels perform better in the task of PolSAR image classification. However, the development of deep 3D-CNNs will cause a huge number of model parameters and expensive computational costs, which not only leads to the decrease of the interpretation speed during testing, but also greatly increases the risk of over-fitting. To alleviate this problem, a lightweight 3D-CNN framework that compresses 3D-CNNs from two aspects is proposed in this paper. Lightweight convolution operations, i.e., pseudo-3D and 3D-depthwise separable convolutions, are considered as low-latency replacements for vanilla 3D convolution. Further, fully connected layers are replaced by global average pooling to reduce the number of model parameters so as to save the memory. Under the specific classification task, the proposed methods can reduce up to 69.83% of the model parameters in convolution layers of the 3D-CNN as well as almost all the model parameters in fully connected layers, which ensures the fast PolSAR interpretation. Experiments on three PolSAR benchmark datasets, i.e., AIRSAR Flevoland, ESAR Oberpfaffenhofen, EMISAR Foulum, show that the proposed lightweight architectures can not only maintain but also slightly improve the accuracy under various criteria.

Download Full-text

Encoding Text Information with Graph Convolutional Networks for Personality Recognition

Applied Sciences ◽

10.3390/app10124081 ◽

2020 ◽

Vol 10 (12) ◽

pp. 4081

Author(s):

Zhe Wang ◽

Chun-Hua Wu ◽

Qing-Biao Li ◽

Bo Yan ◽

Kang-Feng Zheng

Keyword(s):

Personality Traits ◽

Social Engineering ◽

Big Five Personality ◽

Convolutional Network ◽

Convolutional Networks ◽

Wide Range ◽

Benchmark Datasets ◽

Text Information ◽

Class Labels ◽

Personality Recognition

Personality recognition is a classic and important problem in social engineering. Due to the small number and particularity of personality recognition databases, only limited research has explored convolutional neural networks for this task. In this paper, we explore the use of graph convolutional network techniques for inferring a user’s personality traits from their Facebook status updates or essay information. Since the basic five personality traits (such as openness) and their aspects (such as status information) are related to a wide range of text features, this work takes the Big Five personality model as the core of the study. We construct a single user personality graph for the corpus based on user-document relations, document-word relations, and word co-occurrence and then learn the personality graph convolutional networks (personality GCN) for the user. The parameters or the inputs of our personality GCN are initialized with a one-hot representation for users, words and documents; then, under the supervision of users and documents with known class labels, it jointly learns the embeddings for users, words, and documents. We used feature information sharing to incorporate the correlation between the five personality traits into personality recognition to perfect the personality GCN. Our experimental results on two public and authoritative benchmark datasets show that the general personality GCN without any external word embeddings or knowledge is superior to the state-of-the-art methods for personality recognition. The personality GCN method is efficient on small datasets, and the average F1-score and accuracy of personality recognition are improved by up to approximately 3.6% and 2.4–2.57%, respectively.

Download Full-text

LCF: A Local Context Focus Mechanism for Aspect-Based Sentiment Classification

Applied Sciences ◽

10.3390/app9163389 ◽

2019 ◽

Vol 9 (16) ◽

pp. 3389 ◽

Cited By ~ 6

Author(s):

Biqing Zeng ◽

Heng Yang ◽

Ruyang Xu ◽

Wu Zhou ◽

Xuli Han

Keyword(s):

State Of The Art ◽

Sentiment Classification ◽

Experimental Results ◽

Local Context ◽

Baseline Model ◽

Global Context ◽

Benchmark Datasets ◽

Art Performance ◽

Context Features

Aspect-based sentiment classification (ABSC) aims to predict sentiment polarities of different aspects within sentences or documents. Many previous studies have been conducted to solve this problem, but previous works fail to notice the correlation between the aspect’s sentiment polarity and the local context. In this paper, a Local Context Focus (LCF) mechanism is proposed for aspect-based sentiment classification based on Multi-head Self-Attention (MHSA). This mechanism is called LCF design, and utilizes the Context features Dynamic Mask (CDM) and Context Features Dynamic Weighted (CDW) layers to pay more attention to the local context words. Moreover, a BERT-shared layer is adopted to LCF design to capture internal long-term dependencies of local context and global context. Experiments are conducted on three common ABSC datasets: the laptop and restaurant datasets of SemEval-2014 and the ACL twitter dataset. Experimental results demonstrate that the LCF baseline model achieves considerable performance. In addition, we conduct ablation experiments to prove the significance and effectiveness of LCF design. Especially, by incorporating with BERT-shared layer, the LCF-BERT model refreshes state-of-the-art performance on all three benchmark datasets.

Download Full-text

Augmenting Paraphrase Generation with Syntax Information Using Graph Convolutional Networks

Entropy ◽

10.3390/e23050566 ◽

2021 ◽

Vol 23 (5) ◽

pp. 566

Author(s):

Xiaoqiang Chi ◽

Yang Xiang

Keyword(s):

Language Processing ◽

Neural Nets ◽

Linguistic Knowledge ◽

Convolutional Network ◽

Tree Structures ◽

Convolutional Networks ◽

Sequence Modeling ◽

Syntactic Information ◽

Dependency Trees ◽

Paraphrase Generation

Paraphrase generation is an important yet challenging task in natural language processing. Neural network-based approaches have achieved remarkable success in sequence-to-sequence learning. Previous paraphrase generation work generally ignores syntactic information regardless of its availability, with the assumption that neural nets could learn such linguistic knowledge implicitly. In this work, we make an endeavor to probe into the efficacy of explicit syntactic information for the task of paraphrase generation. Syntactic information can appear in the form of dependency trees, which could be easily acquired from off-the-shelf syntactic parsers. Such tree structures could be conveniently encoded via graph convolutional networks to obtain more meaningful sentence representations, which could improve generated paraphrases. Through extensive experiments on four paraphrase datasets with different sizes and genres, we demonstrate the utility of syntactic information in neural paraphrase generation under the framework of sequence-to-sequence modeling. Specifically, our graph convolutional network-enhanced models consistently outperform their syntax-agnostic counterparts using multiple evaluation metrics.

Download Full-text