automatic summarisation
Recently Published Documents


TOTAL DOCUMENTS

11
(FIVE YEARS 0)

H-INDEX

5
(FIVE YEARS 0)

2019 ◽  
Vol 25 (06) ◽  
pp. 735-751
Author(s):  
Constantin Orăsan

AbstractAutomatic text summarisation is a topic that has been receiving attention from the research community from the early days of computational linguistics, but it really took off around 25 years ago. This article presents the main developments from the last 25 years. It starts by defining what a summary is and how its definition changed over time as a result of the interest in processing new types of documents. The article continues with a brief history of the field and highlights the main challenges posed by the evaluation of summaries. The article finishes with some thoughts about the future of the field.



Author(s):  
Samira Lagrini ◽  
Nabiha Azizi ◽  
Monther Al Dwairi ◽  
Mohammed Redjimi


Author(s):  
Samira Lagrini ◽  
Nabiha Azizi ◽  
Mohammed Redjimi ◽  
Monther Al Dwairi


Corpora ◽  
2017 ◽  
Vol 12 (1) ◽  
pp. 23-54 ◽  
Author(s):  
Paula C.F. Cardoso ◽  
Thiago A.S. Pardo ◽  
Maite Taboada

Subtopic segmentation aims to break documents into subtopical text passages, which develop a main topic in a text. Being capable of automatically detecting subtopics is very useful for several Natural Language Processing applications. For instance, in automatic summarisation, having the subtopics at hand enables the production of summaries with good subtopic coverage. Given the usefulness of subtopic segmentation, it is common to assemble a reference-annotated corpus that supports the study of the envisioned phenomena and the development and evaluation of systems. In this paper, we describe the subtopic annotation process in a corpus of news texts written in Brazilian Portuguese, following a systematic annotation process and answering the main research questions when performing corpus annotation. Based on this corpus, we propose novel methods for subtopic segmentation following patterns of discourse organisation, specifically using Rhetorical Structure Theory. We show that discourse structures mirror the subtopic changes in news texts. An important outcome of this work is the freely available annotated corpus, which, to the best of our knowledge, is the only one for Portuguese. We demonstrate that some discourse knowledge may significantly help to find boundaries automatically in a text. In particular, the relation type and the level of the tree structure are important features.





2016 ◽  
Vol 67 ◽  
pp. 25-37 ◽  
Author(s):  
Hans Moen ◽  
Laura-Maria Peltonen ◽  
Juho Heimonen ◽  
Antti Airola ◽  
Tapio Pahikkala ◽  
...  


2010 ◽  
Vol 15 (8) ◽  
pp. 1505-1512 ◽  
Author(s):  
Pietro H. Guzzi ◽  
Maria Teresa Di Martino ◽  
Giuseppe Tradigo ◽  
Pierangelo Veltri ◽  
Pierfrancesco Tassone ◽  
...  


2010 ◽  
Vol 16 (2) ◽  
pp. 161-192 ◽  
Author(s):  
ALMER S. TIGELAAR ◽  
RIEKS OP DEN AKKER ◽  
DJOERD HIEMSTRA

AbstractWeb-based discussion fora proliferate on the Internet. These fora consist of threads about specific matters. Existing forum search facilities provide an easy way for finding threads of interest. However, understanding the content of threads is not always trivial. This problem becomes more pressing as threads become longer. It frustrates users that are looking for specific information and also makes it more difficult to make valuable contributions to a discussion. We postulate that having a concise summary of a thread would greatly help forum users. But, how would we best create such summaries? In this paper, we present an automated method of summarising threads in discussion fora. Compared with summarisation of unstructured texts and spoken dialogues, the structural characteristics of threads give important advantages. We studied how to best exploit these characteristics. Messages in threads contain both explicit and implicit references to each other and are structured. Therefore, we term the threads hierarchical dialogues. Our proposed summarisation algorithm produces one summary of an hierarchical dialogue by ‘cherry-picking’ sentences out of the original messages that make up a thread. We try to select sentences usable for obtaining an overview of the discussion. Our method is built around a set of heuristics based on observations of real fora discussions. The data used for this research was in Dutch, but the developed method equally applies to other languages. We evaluated our approach using a prototype. Users judged our summariser as very useful, half of them indicating they would use it regularly or always when visiting fora.



2007 ◽  
Vol 43 (1) ◽  
pp. 146-153 ◽  
Author(s):  
Shao Fen Liang ◽  
Siobhan Devlin ◽  
John Tait


Sign in / Sign up

Export Citation Format

Share Document