Statistical Features for Extractive Automatic Text Summarization

Automatic Text Summarization (ATS) enables users to save their precious time to retrieve their relevant information need while searching voluminous big data. Text summaries are sensitive to scoring methods, as most of the methods requires to weight features for sentence scoring. In this chapter, various statistical features proposed by researchers for extractive automatic text summarization are explored. Features that perform well are termed as best features using ROUGE evaluation measures and used for creating feature combinations. After that, best performing feature combinations are identified. Performance evaluation of best performing feature combinations on short, medium and large size documents is also conducted using same ROUGE performance measures.

Prediction and Analysis of Extracting Relations using Spacy Model

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f8524.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 3281-3287

Keyword(s):

Natural Language ◽

Information Extraction ◽

Performance Measures ◽

Text Summarization ◽

Language Understanding ◽

Language Generation ◽

Structured Information ◽

Automatic Text ◽

F Measure

Text is an extremely rich resources of information. Each and every second, minutes, peoples are sending or receiving hundreds of millions of data. There are various tasks involved in NLP are machine learning, information extraction, information retrieval, automatic text summarization, question-answered system, parsing, sentiment analysis, natural language understanding and natural language generation. The information extraction is an important task which is used to find the structured information from unstructured or semi-structured text. The paper presents a methodology for extracting the relations of biomedical entities using spacy. The framework consists of following phases such as data creation, load and converting the data into spacy object, preprocessing, define the pattern and extract the relations. The dataset is downloaded from NCBI database which contains only the sentences. The created model evaluated with performance measures like precision, recall and f-measure. The model achieved 87% of accuracy in retrieving of entities relation.

Text Summarization

International Journal Of Engineering And Computer Science ◽

10.18535/ijecs/v9i2.4437 ◽

2020 ◽

Vol 9 (2) ◽

pp. 24940-24945

Author(s):

A. Vikas ◽

Pradyumna G.V.N ◽

Tahir Ahmed Shaik

Keyword(s):

Relevant Information ◽

Text Summarization ◽

The Internet ◽

Human Beings ◽

New Era ◽

Meaningful Information ◽

In this new era, where tremendous information is available on the internet, it is most important to provide the improved mechanism to extract the information quickly and most efficiently. It is very difficult for human beings to manually extract the summary of a large documents of text. There are plenty of text material available on the internet. So, there is a problem of searching for relevant documents from the number of documents available and absorbing relevant information from it. In order to solve the above two problems, the automatic text summarization is very much necessary. Text summarization is the process of identifying the most important meaningful information in a document or set of related documents and compressing them into a shorter version preserving its overall meanings.

A Multi-document Summarization System for News Articles in Portuguese using Integer Linear Programming

10.5753/eniac.2019.9320 ◽

2019 ◽

Author(s):

Laerth Gomes ◽

Hilário Oliveira

Keyword(s):

Linear Programming ◽

Integer Linear Programming ◽

Relevant Information ◽

Brazilian Portuguese ◽

Text Summarization ◽

Document Summarization ◽

Summarization System ◽

Automatic Text ◽

Intense Research

Automatic Text Summarization (ATS) has been demanding intense research in recent years. Its importance is given the fact that ATS systems can aid in the processing of large amounts of textual documents. The ATS task aims to create a summary of one or more documents by extracting their most relevant information. Despite the existence of several works, researches involving the development of ATS systems for documents written in Brazilian Portuguese are still a few. In this paper, we propose a multi-document summarization system following a concept-based approach using Integer Linear Programming for the generation of summaries from news articles written in Portuguese. Experiments using the CSTNews corpus were performed to evaluate different aspects of the proposed system. The experimental results obtained regarding the ROUGE measures demonstrate that the developed system presents encourage results, outperforming other works of the literature.

Advances in Data Mining and Database Management - Innovative Document Summarization Techniques ◽

Novel Text Summarization Techniques for Contextual Advertising

10.4018/978-1-4666-5019-0.ch008 ◽

2014 ◽

pp. 185-204

Author(s):

Giuliano Armano ◽

Alessandro Giuliani

Keyword(s):

Information Overload ◽

Relevant Information ◽

Text Summarization ◽

Experimental Results ◽

The Internet ◽

Continuous Growth ◽

Contextual Advertising ◽

Stored Information ◽

Recently, there has been a renewed interest on automatic text summarization techniques. The Internet has caused a continuous growth of information overload, focusing the attention on retrieval and filtering needs. Since digitally stored information is more and more available, users need suitable tools able to select, filter, and extract only relevant information. This chapter concentrates on studying and developing techniques for summarizing Webpages. In particular, the focus is the field of contextual advertising, the task of automatically suggesting ads within the content of a generic Webpage. Several novel text summarization techniques are proposed, comparing them with state of the art techniques and assessing whether the proposed techniques can be successfully applied to contextual advertising. Comparative experimental results are also reported and discussed. Results highlight the improvements of the proposals with respect to well-known text summarization techniques.

Novel Text Summarization Techniques for Contextual Advertising

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch038 ◽

2018 ◽

pp. 883-903

Author(s):

Giuliano Armano ◽

Alessandro Giuliani

Keyword(s):

Information Overload ◽

Relevant Information ◽

Text Summarization ◽

Experimental Results ◽

The Internet ◽

Continuous Growth ◽

Contextual Advertising ◽

Stored Information ◽

Recently, there has been a renewed interest on automatic text summarization techniques. The Internet has caused a continuous growth of information overload, focusing the attention on retrieval and filtering needs. Since digitally stored information is more and more available, users need suitable tools able to select, filter, and extract only relevant information. This chapter concentrates on studying and developing techniques for summarizing Webpages. In particular, the focus is the field of contextual advertising, the task of automatically suggesting ads within the content of a generic Webpage. Several novel text summarization techniques are proposed, comparing them with state of the art techniques and assessing whether the proposed techniques can be successfully applied to contextual advertising. Comparative experimental results are also reported and discussed. Results highlight the improvements of the proposals with respect to well-known text summarization techniques.

Proceedings of the 2020 4th International Symposium on Computer Science and Intelligent Control ◽

Automatic Text Summarization on Social Media

10.1145/3440084.3441182 ◽

2020 ◽

Author(s):

Zhang Kerui ◽

Hu Haichao ◽

Liu Yuxia

Keyword(s):

Social Media ◽

Text Summarization ◽

Proceedings of the second ACM/IEEE-CS joint conference on Digital libraries - JCDL '02 ◽

Using librarian techniques in automatic text summarization for information retrieval

10.1145/544220.544227 ◽

2002 ◽

Cited By ~ 7

Author(s):

Min-Yen Kan ◽

Judith L. Klavans

Keyword(s):

Information Retrieval ◽

Text Summarization ◽

A Quantum-Inspired Genetic Algorithm for Extractive Text Summarization

International Journal of Natural Computing Research ◽

10.4018/ijncr.2021040103 ◽

2021 ◽

Vol 10 (2) ◽

pp. 42-60

Author(s):

Khadidja Chettah ◽

Amer Draa

Keyword(s):

Genetic Algorithm ◽

State Of The Art ◽

Text Summarization ◽

Automated System ◽

Evaluation Metrics ◽

Document Summarization ◽

Reference Methods ◽

Textual Data ◽

Automatic text summarization has recently become a key instrument for reducing the huge quantity of textual data. In this paper, the authors propose a quantum-inspired genetic algorithm (QGA) for extractive single-document summarization. The QGA is used inside a totally automated system as an optimizer to search for the best combination of sentences to be put in the final summary. The presented approach is compared with 11 reference methods including supervised and unsupervised summarization techniques. They have evaluated the performances of the proposed approach on the DUC 2001 and DUC 2002 datasets using the ROUGE-1 and ROUGE-2 evaluation metrics. The obtained results show that the proposal can compete with other state-of-the-art methods. It is ranked first out of 12, outperforming all other algorithms.

Advances in Artificial Intelligence - IBERAMIA 2018 - Lecture Notes in Computer Science ◽

Calculating the Upper Bounds for Portuguese Automatic Text Summarization Using Genetic Algorithm

10.1007/978-3-030-03928-8_36 ◽

2018 ◽

pp. 442-454 ◽

Cited By ~ 1

Author(s):

Jonathan Rojas-Simón ◽

Yulia Ledeneva ◽

René Arnulfo García-Hernández

Keyword(s):

Genetic Algorithm ◽

Upper Bounds ◽

Text Summarization ◽