A Multi-document Summarization System for News Articles in Portuguese using Integer Linear Programming

Automatic text summarization has recently become a key instrument for reducing the huge quantity of textual data. In this paper, the authors propose a quantum-inspired genetic algorithm (QGA) for extractive single-document summarization. The QGA is used inside a totally automated system as an optimizer to search for the best combination of sentences to be put in the final summary. The presented approach is compared with 11 reference methods including supervised and unsupervised summarization techniques. They have evaluated the performances of the proposed approach on the DUC 2001 and DUC 2002 datasets using the ROUGE-1 and ROUGE-2 evaluation metrics. The obtained results show that the proposal can compete with other state-of-the-art methods. It is ranked first out of 12, outperforming all other algorithms.

Download Full-text

Automatic Text Summarization of COVID-19 Research Articles Using Recurrent Neural Networks and Coreference Resolution

Frontiers in Biomedical Technologies ◽

10.18502/fbt.v7i4.5321 ◽

2021 ◽

Author(s):

Mahsa Afsharizadeh ◽

Hossein Ebrahimpour-Komleh ◽

Ayoub Bagheri

Keyword(s):

Neural Network ◽

Recurrent Neural Network ◽

Text Summarization ◽

Medical Community ◽

Research Articles ◽

Coreference Resolution ◽

Extractive Summarization ◽

Automatic Text Summarization ◽

Summarization System ◽

Automatic Text

Purpose: Pandemic COVID-19 has created an emergency for the medical community. Researchers require extensive study of scientific literature in order to discover drugs and vaccines. In this situation where every minute is valuable to save the lives of hundreds of people, a quick understanding of scientific articles will help the medical community. Automatic text summarization makes this possible. Materials and Methods: In this study, a recurrent neural network-based extractive summarization is proposed. The extractive method identifies the informative parts of the text. Recurrent neural network is very powerful for analyzing sequences such as text. The proposed method has three phases: sentence encoding, sentence ranking, and summary generation. To improve the performance of the summarization system, a coreference resolution procedure is used. Coreference resolution identifies the mentions in the text that refer to the same entity in the real world. This procedure helps to summarization process by discovering the central subject of the text. Results: The proposed method is evaluated on the COVID-19 research articles extracted from the CORD-19 dataset. The results show that the combination of using recurrent neural network and coreference resolution embedding vectors improves the performance of the summarization system. The Proposed method by achieving the value of ROUGE1-recall 0.53 demonstrates the improvement of summarization performance by using coreference resolution embedding vectors in the RNN-based summarization system. Conclusion: In this study, coreference information is stored in the form of coreference embedding vectors. Jointly use of recurrent neural network and coreference resolution results in an efficient summarization system.

Download Full-text

Statistical Features for Extractive Automatic Text Summarization

Natural Language Processing ◽

10.4018/978-1-7998-0951-7.ch030 ◽

2020 ◽

pp. 619-637

Author(s):

Yogesh Kumar Meena ◽

Dinesh Gopalani

Keyword(s):

Big Data ◽

Performance Measures ◽

Relevant Information ◽

Text Summarization ◽

Statistical Features ◽

Information Need ◽

Evaluation Measures ◽

Large Size ◽

Automatic Text Summarization ◽

Automatic Text

Automatic Text Summarization (ATS) enables users to save their precious time to retrieve their relevant information need while searching voluminous big data. Text summaries are sensitive to scoring methods, as most of the methods requires to weight features for sentence scoring. In this chapter, various statistical features proposed by researchers for extractive automatic text summarization are explored. Features that perform well are termed as best features using ROUGE evaluation measures and used for creating feature combinations. After that, best performing feature combinations are identified. Performance evaluation of best performing feature combinations on short, medium and large size documents is also conducted using same ROUGE performance measures.

Download Full-text

Personalized Text Content Summarizer for Mobile Learning: An Automatic Text Summarization System with Relevance Based Language Model

2012 IEEE Fourth International Conference on Technology for Education ◽

10.1109/t4e.2012.23 ◽

2012 ◽

Cited By ~ 6

Author(s):

Guangbing Yang ◽

Dunwei Wen ◽

Kinshuk ◽

Nian-Shing Chen ◽

Erkki Sutinen

Keyword(s):

Mobile Learning ◽

Language Model ◽

Text Summarization ◽

Automatic Text Summarization ◽

Summarization System ◽

Text Content ◽

Automatic Text

Download Full-text

Text Summarization

International Journal Of Engineering And Computer Science ◽

10.18535/ijecs/v9i2.4437 ◽

2020 ◽

Vol 9 (2) ◽

pp. 24940-24945

Author(s):

A. Vikas ◽

Pradyumna G.V.N ◽

Tahir Ahmed Shaik

Keyword(s):

Relevant Information ◽

Text Summarization ◽

The Internet ◽

Human Beings ◽

New Era ◽

Meaningful Information ◽

Automatic Text Summarization ◽

Automatic Text

In this new era, where tremendous information is available on the internet, it is most important to provide the improved mechanism to extract the information quickly and most efficiently. It is very difficult for human beings to manually extract the summary of a large documents of text. There are plenty of text material available on the internet. So, there is a problem of searching for relevant documents from the number of documents available and absorbing relevant information from it. In order to solve the above two problems, the automatic text summarization is very much necessary. Text summarization is the process of identifying the most important meaningful information in a document or set of related documents and compressing them into a shorter version preserving its overall meanings.

Download Full-text

Statistical Features for Extractive Automatic Text Summarization

Advances in Business Information Systems and Analytics - Enterprise Big Data Engineering, Analytics, and Management ◽

10.4018/978-1-5225-0293-7.ch008 ◽

2016 ◽

pp. 126-144

Author(s):

Yogesh Kumar Meena ◽

Dinesh Gopalani

Keyword(s):

Big Data ◽

Performance Measures ◽

Relevant Information ◽

Text Summarization ◽

Statistical Features ◽

Information Need ◽

Evaluation Measures ◽

Large Size ◽

Automatic Text Summarization ◽

Automatic Text

Automatic Text Summarization (ATS) enables users to save their precious time to retrieve their relevant information need while searching voluminous big data. Text summaries are sensitive to scoring methods, as most of the methods requires to weight features for sentence scoring. In this chapter, various statistical features proposed by researchers for extractive automatic text summarization are explored. Features that perform well are termed as best features using ROUGE evaluation measures and used for creating feature combinations. After that, best performing feature combinations are identified. Performance evaluation of best performing feature combinations on short, medium and large size documents is also conducted using same ROUGE performance measures.

Download Full-text

Implementasi Algoritma Graf dan Algoritma Genetika pada Peringkasan Single Document

Repositor ◽

10.22219/repositor.v2i11.891 ◽

2020 ◽

Vol 2 (11) ◽

pp. 1521

Author(s):

Lina Dwi Yulianti ◽

Setio Basuki ◽

Yufis Azhar

Keyword(s):

Genetic Algorithms ◽

Graph Algorithms ◽

System Development ◽

High Accuracy ◽

Text Summarization ◽

Test Results ◽

The Core ◽

Automatic Text Summarization ◽

Summarization System ◽

Automatic Text

In today's technological advancements, finding information is easier and faster. But not a little information that is not true or commonly referred to as hoaxes. Therefore, information must be obtained from several sources to ensure the accuracy of the information. Automatic Text Summarization System is a system used for text based document summarization. This system can help find the core of a news document, so it does not require much time to read. Researchers use Graph Algorithms and Genetic Algorithms in system development. From the test results obtained by the accuracy of the system produced by the system with manual numbers have a cosine similarity value of 71.21%. This can prove that the system built can be used by users because the results of tests carried out get high accuracy values.

Download Full-text

A novel automatic text summarization system with feature terms identification

2011 Annual IEEE India Conference ◽

10.1109/indcon.2011.6139386 ◽

2011 ◽

Cited By ~ 2

Author(s):

Suneetha Manne ◽

Shaik Mohammed Zaheer Pervez ◽

S. Sameen Fatima

Keyword(s):

Text Summarization ◽

Automatic Text Summarization ◽

Summarization System ◽

Automatic Text

Download Full-text

Automatic text summarization system using a stochastic model

Machine Learning and Data Analysis ◽

10.21469/22233792.4.4.04 ◽

2018 ◽

Vol 4 (4) ◽

pp. 266-279

Author(s):

Tamara Voznesenskaya

Keyword(s):

Stochastic Model ◽

Text Summarization ◽

Automatic Text Summarization ◽

Summarization System ◽

Automatic Text

Download Full-text

Survey of Scientific Document Summarization Techniques

Computer Science ◽

10.7494/csci.2020.21.2.3356 ◽

2020 ◽

Vol 21 (2) ◽

Author(s):

Sheena Kurian K ◽

Sheena Mathew

Keyword(s):

Text Summarization ◽

Exponential Rate ◽

Research Papers ◽

Document Summarization ◽

Automatic Text Summarization ◽

Scientific Document Summarization ◽

Pros And Cons ◽

Comparison Of The Results ◽

Evaluation Techniques ◽

Automatic Text

The number of scientic or research papers published every year is growing at an exponential rate, which has led to an intensive research in scientic document summarization. The different methods commonly used in automatic text summarization are discussed in this paper with their pros and cons. Commonly used evaluation techniques and datasets in this field are also discussed. Rouge and Pyramid scores of the different methods are tabulated for easy comparison of the results.

Download Full-text