Abstractive Summarization: A Survey of the State of the Art

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019815 ◽

2019 ◽

Vol 33 ◽

pp. 9815-9822 ◽

Cited By ~ 5

Author(s):

Hui Lin ◽

Vincent Ng

Keyword(s):

Machine Translation ◽

State Of The Art ◽

The State ◽

Text Summarization ◽

Abstract Representation ◽

Automatic Text Summarization ◽

Input Text ◽

Gradual Shift ◽

Abstractive Summarization ◽

Automatic Text

The focus of automatic text summarization research has exhibited a gradual shift from extractive methods to abstractive methods in recent years, owing in part to advances in neural methods. Originally developed for machine translation, neural methods provide a viable framework for obtaining an abstract representation of the meaning of an input text and generating informative, fluent, and human-like summaries. This paper surveys existing approaches to abstractive summarization, focusing on the recently developed neural approaches.

Download Full-text

Algebraic reduction in automatic text summarization – the state of the art

International Conference on Computer and Communication Engineering (ICCCE'10) ◽

10.1109/iccce.2010.5556770 ◽

2010 ◽

Cited By ~ 2

Author(s):

Nowshath Kadhar Batcha ◽

Ahmed. M. Zaki

Keyword(s):

State Of The Art ◽

The State ◽

Text Summarization ◽

Automatic Text Summarization ◽

Algebraic Reduction ◽

Automatic Text

Download Full-text

A Quantum-Inspired Genetic Algorithm for Extractive Text Summarization

International Journal of Natural Computing Research ◽

10.4018/ijncr.2021040103 ◽

2021 ◽

Vol 10 (2) ◽

pp. 42-60

Author(s):

Khadidja Chettah ◽

Amer Draa

Keyword(s):

Genetic Algorithm ◽

State Of The Art ◽

Text Summarization ◽

Automated System ◽

Evaluation Metrics ◽

Document Summarization ◽

Automatic Text Summarization ◽

Reference Methods ◽

Textual Data ◽

Automatic Text

Automatic text summarization has recently become a key instrument for reducing the huge quantity of textual data. In this paper, the authors propose a quantum-inspired genetic algorithm (QGA) for extractive single-document summarization. The QGA is used inside a totally automated system as an optimizer to search for the best combination of sentences to be put in the final summary. The presented approach is compared with 11 reference methods including supervised and unsupervised summarization techniques. They have evaluated the performances of the proposed approach on the DUC 2001 and DUC 2002 datasets using the ROUGE-1 and ROUGE-2 evaluation metrics. The obtained results show that the proposal can compete with other state-of-the-art methods. It is ranked first out of 12, outperforming all other algorithms.

Download Full-text

A Pointer Generator Network Model to Automatic Text Summarization and Headline Generation

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.e1094.0785s319 ◽

2019 ◽

Vol 8 (5S3) ◽

pp. 447-451

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Text Summarization ◽

Daily Mail ◽

Automatic Text Summarization ◽

Generator Model ◽

Abstractive Summarization ◽

Automatic Text

In a world where information is growing rapidly every single day, we need tools to generate summary and headlines from text which is accurate as well as short and precise. In this paper, we have described a method for generating headlines from article. This is done by using hybrid pointer-generator network with attention distribution and coverage mechanism on article which generates abstractive summarization followed by the application of encoder-decoder recurrent neural network with LSTM unit to generate headlines from the summary. Hybrid pointer generator model helps in removing inaccuracy as well as repetitions. We have used CNN / Daily Mail as our dataset.

Download Full-text

SISTEM AUTOMATIC TEXT SUMMARIZATION MENGGUNAKAN ALGORITMA TEXTRANK

MATICS ◽

10.18860/mat.v12i2.8372 ◽

2020 ◽

Vol 12 (2) ◽

pp. 111-116

Author(s):

Muhammad Adib Zamzam

Keyword(s):

Text Summarization ◽

Automatic Text Summarization ◽

Unit Unit ◽

Abstractive Summarization ◽

Automatic Text

Text summarization (perangkuman teks) adalah pendekatan yang bisa digunakan untuk meringkas atau memadatkan teks artikel yang panjang menjadi lebih pendek dan ringkas sehingga hasil rangkuman teks yang relatif lebih pendek bisa mewakilkan teks yang panjang. Automatic Text Summarization adalah perangkuman teks yang dilakukan secara otomatis oleh komputer. Terdapat dua macam algoritma Automatic Text Summarization yaitu Extraction-based summarization dan Abstractive summarization. Algoritma TextRank merupakan algoritma extraction-based atau extractive, dimana ekstraksi di sini berarti memilih unit teks (kalimat, segmen-segmen kalimat, paragraf atau passages), lalu dianggap berisi informasi penting dari dokumen dan menyusun unit-unit (kalimat-kalimat) tersebut dengan cara yang benar. Hasil penelitian dengan input 50 artikel dan hasil rangkuman sebanyak 12,5% dari teks asli menunjukkan bahwa sistem memiliki nilai recall ROUGE 41,659 %. Nilai tertinggi recall ROUGE tertinggi tercatat pada artikel 48 dengan nilai 0,764. Nilai terendah recall ROUGE tercatat pada artikel 37 dengan nilai 0,167.

Download Full-text

Automatic Text Summarization: A State-of-the-Art Review

Proceedings of the 22nd International Conference on Enterprise Information Systems ◽

10.5220/0009723306480655 ◽

2020 ◽

Author(s):

Oleksandra Klymenko ◽

Daniel Braun ◽

Florian Matthes

Keyword(s):

State Of The Art ◽

Text Summarization ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text

Latent Semantic Analysis in Automatic Text Summarization: A state of the art analysis

International Journal of Intelligence and Sustainable Computing ◽

10.1504/ijisc.2020.10029282 ◽

2020 ◽

Vol 1 (1) ◽

pp. 1

Author(s):

Mehala N ◽

Tapas Guha

Keyword(s):

Latent Semantic Analysis ◽

Semantic Analysis ◽

State Of The Art ◽

Text Summarization ◽

Automatic Text Summarization ◽

Art Analysis ◽

Automatic Text

Download Full-text

MultiSumm: Towards a Unified Model for Multi-Lingual Abstractive Summarization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5328 ◽

2020 ◽

Vol 34 (01) ◽

pp. 11-18

Author(s):

Yue Cao ◽

Xiaojun Wan ◽

Jinge Yao ◽

Dian Yu

Keyword(s):

Language Model ◽

Text Summarization ◽

Additional Contribution ◽

Automatic Text Summarization ◽

Proposed Model ◽

Back Translation ◽

Model Training ◽

Abstractive Summarization ◽

Multiple Languages ◽

Automatic Text

Automatic text summarization aims at producing a shorter version of the input text that conveys the most important information. However, multi-lingual text summarization, where the goal is to process texts in multiple languages and output summaries in the corresponding languages with a single model, has been rarely studied. In this paper, we present MultiSumm, a novel multi-lingual model for abstractive summarization. The MultiSumm model uses the following training regime: (I) multi-lingual learning that contains language model training, auto-encoder training, translation and back-translation training, and (II) joint summary generation training. We conduct experiments on summarization datasets for five rich-resource languages: English, Chinese, French, Spanish, and German, as well as two low-resource languages: Bosnian and Croatian. Experimental results show that our proposed model significantly outperforms a multi-lingual baseline model. Specifically, our model achieves comparable or even better performance than models trained separately on each language. As an additional contribution, we construct the first summarization dataset for Bosnian and Croatian, containing 177,406 and 204,748 samples, respectively.

Download Full-text

Automatic Text Summarization by Providing Coverage, Non-Redundancy, and Novelty Using Sentence Graph

Journal of Information Technology Research ◽

10.4018/jitr.2022010108 ◽

2022 ◽

Vol 15 (1) ◽

pp. 1-18

Author(s):

Krishnaveni P. ◽

Balasundaram S. R.

Keyword(s):

Graph Algorithms ◽

Maximal Clique ◽

Text Summarization ◽

Original Text ◽

Online Information ◽

Automatic Text Summarization ◽

Global Properties ◽

Input Text ◽

Local Properties ◽

Automatic Text

The day-to-day growth of online information necessitates intensive research in automatic text summarization (ATS). The ATS software produces summary text by extracting important information from the original text. With the help of summaries, users can easily read and understand the documents of interest. Most of the approaches for ATS used only local properties of text. Moreover, the numerous properties make the sentence selection difficult and complicated. So this article uses a graph based summarization to utilize structural and global properties of text. It introduces maximal clique based sentence selection (MCBSS) algorithm to select important and non-redundant sentences that cover all concepts of the input text for summary. The MCBSS algorithm finds novel information using maximal cliques (MCs). The experimental results of recall oriented understudy for gisting evaluation (ROUGE) on Timeline dataset show that the proposed work outperforms the existing graph algorithms Bushy Path (BP), Aggregate Similarity (AS), and TextRank (TR).

Download Full-text

A Systematic Survey on Multi-document Text Summarization

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/111062021 ◽

2021 ◽

Vol 10 (6) ◽

pp. 3148-3153

Keyword(s):

Deep Learning ◽

Text Summarization ◽

Evaluation Metrics ◽

Automatic Process ◽

Document Summarization ◽

Text Document ◽

Automatic Text Summarization ◽

As Graph ◽

Abstractive Summarization ◽

Automatic Text

Automatic text summarization is a technique of generating short and accurate summary of a longer text document. Text summarization can be classified based on the number of input documents (single document and multi-document summarization) and based on the characteristics of the summary generated (extractive and abstractive summarization). Multi-document summarization is an automatic process of creating relevant, informative and concise summary from a cluster of related documents. This paper does a detailed survey on the existing literature on the various approaches for text summarization. Few of the most popular approaches such as graph based, cluster based and deep learning-based summarization techniques are discussed here along with the evaluation metrics, which can provide an insight to the future researchers.

Download Full-text

Multilingual Summarization Approaches

Advances in Data Mining and Database Management - Innovative Document Summarization Techniques ◽

10.4018/978-1-4666-5019-0.ch011 ◽

2014 ◽

pp. 257-276

Author(s):

Kamal Sarkar

Keyword(s):

Information Overload ◽

State Of The Art ◽

Text Summarization ◽

Automatic Text Summarization ◽

On Line ◽

Multilingual Text ◽

Automatic Text ◽

The Web

As the amount of on-line information in the languages other than English (such as Chinese, Japanese, German, French, Hindi, etc.) increases, systems that can automatically summarize multilingual documents are becoming increasingly desirable for managing information overload problem on the Web. This chapter presents an overview of automatic text summarization with special emphasis on multilingual text summarization. The various state-of-the-art multilingual summarization approaches have been grouped based on their characteristics and presented in this chapter.

Download Full-text