automatic abstracting Latest Research Papers

The goal of the publication is to present the state of research and works carried out in Poland on the issue of automatic text summarization. The author describes principal theoretical and methodological issues related to automatic summary generation followed by the outline of the selected works on the automatic abstracting of Polish texts. The author also provides three examples of IT tools that generate summaries of texts in Polish (Summarize, Resoomer, and NICOLAS) and their characteristics derived from the conducted experiment, which included quality assessment of generated summaries using ROUGE-N metrics. The results of both actions showed a deficiency of tools allowing to automatically create summaries of Polish texts, especially in the abstractive approach. Most of the proposed solutions are based on the extractive method, which uses parts of the original text to create its abstract. There is also a shortage of tools generating one common summary of many text documents and specialized tools generating summaries of documents related to specific subject areas. Moreover, it is necessary to intensify works on creating the corpora of Polish-language text summaries, which the computer scientists could apply to evaluate their newly developed tools.

Download Full-text

Narzędzia do automatycznego streszczania tekstów w języku polskim. Stan badań naukowych i prac wdrożeniowych

e-mentor ◽

10.15219/em88.1513 ◽

2021 ◽

Vol 89 (2) ◽

pp. 67-77

Author(s):

Piotr Glenc ◽

Keyword(s):

Original Text ◽

Text Documents ◽

Automatic Text Summarization ◽

Computer Scientists ◽

Polish Language ◽

Subject Areas ◽

Automatic Abstracting ◽

State Of Research ◽

Automatic Text ◽

Language Text

The goal of the publication is to present the state of research and works carried out in Poland on the issue of automatic text summarization. The author describes principal theoretical and methodological issues related to automatic summary generation followed by the outline of the selected works on the automatic abstracting of Polish texts. The author also provides three examples of IT tools that generate summaries of texts in Polish (Summarize, Resoomer, and NICOLAS) and their characteristics derived from the conducted experiment, which included quality assessment of generated summaries using ROUGE-N metrics. The results of both actions showed a deficiency of tools allowing to automatically create summaries of Polish texts, especially in the abstractive approach. Most of the proposed solutions are based on the extractive method, which uses parts of the original text to create its abstract. There is also a shortage of tools generating one common summary of many text documents and specialized tools generating summaries of documents related to specific subject areas. Moreover, it is necessary to intensify works on creating the corpora of Polish-language text summaries, which the computer scientists could apply to evaluate their newly developed tools.

Download Full-text

Automatic Abstracting and Summarization

Encyclopedia of Library and Information Science, Fourth Edition ◽

10.1081/e-elis4-120008882 ◽

2017 ◽

pp. 418-429

Keyword(s):

Automatic Abstracting

Download Full-text

On the composition of scientific abstracts

Journal of Documentation ◽

10.1108/jdoc-09-2015-0111 ◽

2016 ◽

Vol 72 (4) ◽

pp. 636-647 ◽

Cited By ~ 9

Author(s):

Iana Atanassova ◽

Marc Bertin ◽

Vincent Larivière

Keyword(s):

Design Methodology ◽

Public Library ◽

The Body ◽

Scientific Article ◽

Similarity Metrics ◽

Rhetorical Structure ◽

Content Type ◽

Text Organization ◽

Strong Relation ◽

Automatic Abstracting

Purpose – Scientific abstracts reproduce only part of the information and the complexity of argumentation in a scientific article. The purpose of this paper provides a first analysis of the similarity between the text of scientific abstracts and the body of articles, using sentences as the basic textual unit. It contributes to the understanding of the structure of abstracts. Design/methodology/approach – Using sentence-based similarity metrics, the authors quantify the phenomenon of text re-use in abstracts and examine the positions of the sentences that are similar to sentences in abstracts in the introduction, methods, results and discussion structure, using a corpus of over 85,000 research articles published in the seven Public Library of Science journals. Findings – The authors provide evidence that 84 percent of abstract have at least one sentence in common with the body of the paper. Studying the distributions of sentences in the body of the articles that are re-used in abstracts, the authors show that there exists a strong relation between the rhetorical structure of articles and the zones that authors re-use when writing abstracts, with sentences mainly coming from the beginning of the introduction and the end of the conclusion. Originality/value – Scientific abstracts contain what is considered by the author(s) as information that best describe documents’ content. This is a first study that examines the relation between the contents of abstracts and the rhetorical structure of scientific articles. The work might provide new insight for improving automatic abstracting tools as well as information retrieval approaches, in which text organization and structure are important features.

Download Full-text

Research on Automatic Abstracting Methods Based on Sentences Clustering

Proceedings of the 9th International Symposium on Linear Drives for Industry Applications, Volume 4 - Lecture Notes in Electrical Engineering ◽

10.1007/978-3-642-40640-9_51 ◽

2013 ◽

pp. 405-410

Author(s):

Yang Luo

Keyword(s):

Automatic Abstracting

Download Full-text

Research of Extracting Chinese Automatic Abstracting Processing Based on Semidiscrete Matrix Decomposition

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.241-244.3112 ◽

2012 ◽

Vol 241-244 ◽

pp. 3112-3115

Author(s):

Yang Luo

Keyword(s):

Matrix Decomposition ◽

Automatic Abstracting

This paper introduced six popular methods of automatic abstract.Finally，on the basis of LSI this paper proposed an approach of extracting sentences based on SDD sentences clustering

Download Full-text

An Improved Method for the Feature Extraction of Chinese Text by Combining Rough Set Theory with Automatic Abstracting Technology

Communications in Computer and Information Science - Contemporary Research on E-business Technology and Strategy ◽

10.1007/978-3-642-34447-3_44 ◽

2012 ◽

pp. 496-509 ◽

Cited By ~ 1

Author(s):

Min Shen ◽

Baosen Dong ◽

Linying Xu

Keyword(s):

Feature Extraction ◽

Set Theory ◽

Rough Set ◽

Chinese Text ◽

Rough Set Theory ◽

Improved Method ◽

Automatic Abstracting

Download Full-text