Comparative Analysis of Neural Models for Abstractive Text Summarization

Unsupervised pre-training of large neural models has recently revolutionized Natural Language Processing. By warm-starting from the publicly released checkpoints, NLP practitioners have pushed the state-of-the-art on multiple benchmarks while saving significant amounts of compute time. So far the focus has been mainly on the Natural Language Understanding tasks. In this paper, we demonstrate the efficacy of pre-trained checkpoints for Sequence Generation. We developed a Transformer-based sequence-to-sequence model that is compatible with publicly available pre-trained BERT, GPT-2, and RoBERTa checkpoints and conducted an extensive empirical study on the utility of initializing our model, both encoder and decoder, with these checkpoints. Our models result in new state-of-the-art results on Machine Translation, Text Summarization, Sentence Splitting, and Sentence Fusion.

Download Full-text

COMPARATIVE ANALYSIS OF EXTRACTIVE TEXT SUMMARIZATION METHODS FOR TEXTS IN RUSSIAN LANGUAGE.

Theoretical & Applied Science ◽

10.15863/tas.2019.06.74.17 ◽

2019 ◽

Vol 74 (06) ◽

pp. 164-169

Author(s):

a h ◽

◽

l a ◽

Keyword(s):

Comparative Analysis ◽

Text Summarization ◽

Russian Language

Download Full-text

Comparative Analysis of Models for Abstractive Text Summarization

10.1007/978-981-16-3071-2_45 ◽

2021 ◽

pp. 553-563

Author(s):

Minakshi Tomer ◽

Manoj Kumar

Keyword(s):

Comparative Analysis ◽

Text Summarization

Download Full-text

Examining Attention Mechanisms in Deep Learning Models for Sentiment Analysis

Applied Sciences ◽

10.3390/app11093883 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3883

Author(s):

Spyridon Kardakis ◽

Isidoros Perikos ◽

Foteini Grivokostopoulou ◽

Ioannis Hatzilygeroudis

Keyword(s):

Neural Networks ◽

Experimental Study ◽

Deep Learning ◽

Comparative Analysis ◽

Sentiment Analysis ◽

Recurrent Neural Networks ◽

Deep Neural Networks ◽

Training Methods ◽

Learning Models ◽

Neural Models

Attention-based methods for deep neural networks constitute a technique that has attracted increased interest in recent years. Attention mechanisms can focus on important parts of a sequence and, as a result, enhance the performance of neural networks in a variety of tasks, including sentiment analysis, emotion recognition, machine translation and speech recognition. In this work, we study attention-based models built on recurrent neural networks (RNNs) and examine their performance in various contexts of sentiment analysis. Self-attention, global-attention and hierarchical-attention methods are examined under various deep neural models, training methods and hyperparameters. Even though attention mechanisms are a powerful recent concept in the field of deep learning, their exact effectiveness in sentiment analysis is yet to be thoroughly assessed. A comparative analysis is performed in a text sentiment classification task where baseline models are compared with and without the use of attention for every experiment. The experimental study additionally examines the proposed models’ ability in recognizing opinions and emotions in movie reviews. The results indicate that attention-based models lead to great improvements in the performance of deep neural models showcasing up to a 3.5% improvement in their accuracy.

Download Full-text