Text Summarizing Using NLP

In this era everything is digitalized we can find a large amount of digital data for different purposes on the internet and relatively it’s very hard to summarize this data manually. Automatic Text Summarization (ATS) is the subsequent big one that could simply summarize the source data and give us a short version that could preserve the content and the overall meaning. While the concept of ATS is started long back in 1950’s, this field is still struggling to give the best and efficient summaries. ATS proceeds towards 2 methods, Extractive and Abstractive Summarization. The Extractive and Abstractive methods had a process to improve text summarization technique. Text Summarization is implemented with NLP due to packages and methods in Python. Different approaches are present for summarizing the text and having few algorithms with which we can implement it. Text Rank is what to extractive text summarization and it is an unsupervised learning. Text Rank algorithm also uses undirected graphs, weighted graphs. keyword extraction, sentence extraction. So, in this paper, a model is made to get better result in text summarization with Genism library in NLP. This method improves the overall meaning of the phrase and the person reading it can understand in a better way.

Download Full-text

A Pointer Generator Network Model to Automatic Text Summarization and Headline Generation

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.e1094.0785s319 ◽

2019 ◽

Vol 8 (5S3) ◽

pp. 447-451

Keyword(s):

Neural Network ◽

Network Model ◽

Recurrent Neural Network ◽

Text Summarization ◽

Daily Mail ◽

Automatic Text Summarization ◽

Generator Model ◽

Abstractive Summarization ◽

Automatic Text

In a world where information is growing rapidly every single day, we need tools to generate summary and headlines from text which is accurate as well as short and precise. In this paper, we have described a method for generating headlines from article. This is done by using hybrid pointer-generator network with attention distribution and coverage mechanism on article which generates abstractive summarization followed by the application of encoder-decoder recurrent neural network with LSTM unit to generate headlines from the summary. Hybrid pointer generator model helps in removing inaccuracy as well as repetitions. We have used CNN / Daily Mail as our dataset.

Download Full-text

Automatic Text Summarization and Keyword Extraction using Natural Language Processing

2020 International Conference on Electronics and Sustainable Communication Systems (ICESC) ◽

10.1109/icesc48915.2020.9155852 ◽

2020 ◽

Author(s):

Avinash Payak ◽

Saurabh Rai ◽

Kanishka Shrivastava ◽

Reshma Gulwani

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Text Summarization ◽

Keyword Extraction ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text

Implemented Text Rank based Automatic Text Summarization using Keyword Extraction

International Research Journal of Innovations in Engineering and Technology ◽

10.47001/irjiet/2020.411003 ◽

2020 ◽

Vol 04 (11) ◽

pp. 20-25

Author(s):

Anurag Kumar Yadav ◽

Mukesh Kumar ◽

Ayonija Pathre

Keyword(s):

Text Summarization ◽

Keyword Extraction ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text

SISTEM AUTOMATIC TEXT SUMMARIZATION MENGGUNAKAN ALGORITMA TEXTRANK

MATICS ◽

10.18860/mat.v12i2.8372 ◽

2020 ◽

Vol 12 (2) ◽

pp. 111-116

Author(s):

Muhammad Adib Zamzam

Keyword(s):

Text Summarization ◽

Automatic Text Summarization ◽

Unit Unit ◽

Abstractive Summarization ◽

Automatic Text

Text summarization (perangkuman teks) adalah pendekatan yang bisa digunakan untuk meringkas atau memadatkan teks artikel yang panjang menjadi lebih pendek dan ringkas sehingga hasil rangkuman teks yang relatif lebih pendek bisa mewakilkan teks yang panjang. Automatic Text Summarization adalah perangkuman teks yang dilakukan secara otomatis oleh komputer. Terdapat dua macam algoritma Automatic Text Summarization yaitu Extraction-based summarization dan Abstractive summarization. Algoritma TextRank merupakan algoritma extraction-based atau extractive, dimana ekstraksi di sini berarti memilih unit teks (kalimat, segmen-segmen kalimat, paragraf atau passages), lalu dianggap berisi informasi penting dari dokumen dan menyusun unit-unit (kalimat-kalimat) tersebut dengan cara yang benar. Hasil penelitian dengan input 50 artikel dan hasil rangkuman sebanyak 12,5% dari teks asli menunjukkan bahwa sistem memiliki nilai recall ROUGE 41,659 %. Nilai tertinggi recall ROUGE tertinggi tercatat pada artikel 48 dengan nilai 0,764. Nilai terendah recall ROUGE tercatat pada artikel 37 dengan nilai 0,167.

Download Full-text

Text Summarization

International Journal Of Engineering And Computer Science ◽

10.18535/ijecs/v9i2.4437 ◽

2020 ◽

Vol 9 (2) ◽

pp. 24940-24945

Author(s):

A. Vikas ◽

Pradyumna G.V.N ◽

Tahir Ahmed Shaik

Keyword(s):

Relevant Information ◽

Text Summarization ◽

The Internet ◽

Human Beings ◽

New Era ◽

Meaningful Information ◽

Automatic Text Summarization ◽

Automatic Text

In this new era, where tremendous information is available on the internet, it is most important to provide the improved mechanism to extract the information quickly and most efficiently. It is very difficult for human beings to manually extract the summary of a large documents of text. There are plenty of text material available on the internet. So, there is a problem of searching for relevant documents from the number of documents available and absorbing relevant information from it. In order to solve the above two problems, the automatic text summarization is very much necessary. Text summarization is the process of identifying the most important meaningful information in a document or set of related documents and compressing them into a shorter version preserving its overall meanings.

Download Full-text

Language-independent extractive automatic text summarization based on automatic keyword extraction

Computer Speech & Language ◽

10.1016/j.csl.2021.101267 ◽

2021 ◽

pp. 101267

Author(s):

Ángel Hernández-Castañeda ◽

René Arnulfo García-Hernández ◽

Yulia Ledeneva ◽

Christian Eduardo Millán-Hernández

Keyword(s):

Text Summarization ◽

Keyword Extraction ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text

Techniques and Issues in Text Mining

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9079 ◽

2020 ◽

Vol 17 (9) ◽

pp. 4368-4374

Author(s):

Perpetua F. Noronha ◽

Madhu Bhan

Keyword(s):

Language Processing ◽

Text Summarization ◽

Digital Data ◽

Text Documents ◽

Significant Information ◽

Digital Era ◽

Automatic Text Summarization ◽

Text Content ◽

Available Information ◽

Automatic Text

Digital data in huge amount is being persistently generated at an unparalleled and exponential rate. In this digital era where internet stands the prime source for generating incredible information, it is vital to develop better means to mine the available information rapidly and most capably. Manual extraction of the salient information from the large input text documents is a time consuming and inefficient task. In this fast-moving world, it is difficult to read all the text-content and derive insights from it. Automatic methods are required. The task of probing for relevant documents from the large number of sources available, and consuming apt information from it is a challenging task and is need of the hour. Automatic text summarization technique can be used to generate relevant and quality information in less time. Text Summarization is used to condense the source text into a brief summary maintaining its salient information and readability. Generating summaries automatically is in great demand to attend to the growing and increasing amount of text data that is obtainable online in order to mark out the significant information and to consume it faster. Text summarization is becoming extremely popular with the advancement in Natural Language Processing (NLP) and deep learning methods. The most important gain of automatic text summarization is, it reduces the analysis time. In this paper we focus on key approaches to automatic text summarization and also about their efficiency and limitations.

Download Full-text

Abstractive Summarization: A Survey of the State of the Art

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019815 ◽

2019 ◽

Vol 33 ◽

pp. 9815-9822 ◽

Cited By ~ 5

Author(s):

Hui Lin ◽

Vincent Ng

Keyword(s):

Machine Translation ◽

State Of The Art ◽

The State ◽

Text Summarization ◽

Abstract Representation ◽

Automatic Text Summarization ◽

Input Text ◽

Gradual Shift ◽

Abstractive Summarization ◽

Automatic Text

The focus of automatic text summarization research has exhibited a gradual shift from extractive methods to abstractive methods in recent years, owing in part to advances in neural methods. Originally developed for machine translation, neural methods provide a viable framework for obtaining an abstract representation of the meaning of an input text and generating informative, fluent, and human-like summaries. This paper surveys existing approaches to abstractive summarization, focusing on the recently developed neural approaches.

Download Full-text

MultiSumm: Towards a Unified Model for Multi-Lingual Abstractive Summarization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i01.5328 ◽

2020 ◽

Vol 34 (01) ◽

pp. 11-18

Author(s):

Yue Cao ◽

Xiaojun Wan ◽

Jinge Yao ◽

Dian Yu

Keyword(s):

Language Model ◽

Text Summarization ◽

Additional Contribution ◽

Automatic Text Summarization ◽

Proposed Model ◽

Back Translation ◽

Model Training ◽

Abstractive Summarization ◽

Multiple Languages ◽

Automatic Text

Automatic text summarization aims at producing a shorter version of the input text that conveys the most important information. However, multi-lingual text summarization, where the goal is to process texts in multiple languages and output summaries in the corresponding languages with a single model, has been rarely studied. In this paper, we present MultiSumm, a novel multi-lingual model for abstractive summarization. The MultiSumm model uses the following training regime: (I) multi-lingual learning that contains language model training, auto-encoder training, translation and back-translation training, and (II) joint summary generation training. We conduct experiments on summarization datasets for five rich-resource languages: English, Chinese, French, Spanish, and German, as well as two low-resource languages: Bosnian and Croatian. Experimental results show that our proposed model significantly outperforms a multi-lingual baseline model. Specifically, our model achieves comparable or even better performance than models trained separately on each language. As an additional contribution, we construct the first summarization dataset for Bosnian and Croatian, containing 177,406 and 204,748 samples, respectively.

Download Full-text

Chinese Automatic Text Summarization Based on Keyword Extraction

2009 First International Workshop on Database Technology and Applications ◽

10.1109/dbta.2009.9 ◽

2009 ◽

Cited By ~ 2

Author(s):

Xiao-yu Jiang

Keyword(s):

Text Summarization ◽

Keyword Extraction ◽

Automatic Text Summarization ◽

Automatic Text

Download Full-text