automatic document summarization
Recently Published Documents


TOTAL DOCUMENTS

22
(FIVE YEARS 4)

H-INDEX

4
(FIVE YEARS 0)

Author(s):  
I Made Widiartha ◽  
Rukmi Sari Hartati ◽  
Nyoman Putra Sastra ◽  
Dewa Made Wiharta

Information ◽  
2020 ◽  
Vol 11 (11) ◽  
pp. 536
Author(s):  
Heechan Kim ◽  
Soowon Lee

Automatic document summarization is a field of natural language processing that is rapidly improving with the development of end-to-end deep learning models. In this paper, we propose a novel summarization model that consists of three methods. The first is a coverage method based on noise injection that makes the attention mechanism select only important words by defining previous context information as noise. This alleviates the problem that the summarization model generates the same word sequence repeatedly. The second is a word association method to update the information of each word by comparing the information of the current step with the information of all previous decoding steps. According to following words, this catches a change in the meaning of the word that has been already decoded. The third is a method using a suppression loss function that explicitly minimizes the probabilities of non-answer words. The proposed summarization model showed good performance on some recall-oriented understudy for gisting evaluation (ROUGE) metrics compared to the state-of-the-art models in the CNN/Daily Mail summarization task, and the results were achieved with very few learning steps compared to the state-of-the-art models.


Document summarization is the process of generating the summary of the documents gathered from the web sources. It reduces the burden of web readers by reducing the necessity of reading the entire document contents by generating the short summary. In our previous research work this is performed by introducing the method namely Noun weight based Automated Multi-Document Summarization method (NW-AMDSM). However the previous research work doesn’t concentrate on the semantic similarity which might reduce the accuracy of the summarization outcome. This is resolved in the proposed research method by introducing the method namely Semantic Similarity based Automatic Document Summarization Method (SS-ADSM). In this research work, multi document grouping is done is based on semantic similarity computation, thus the document with similar contents can be grouped more accurately. Here the semantic similarity computation is performed with the help of word net analyzer. The document grouping is done by introducing the modified FCM clustering algorithm. Finally hybrid neuro fuzzy genetic algorithm is introduced to perform the automatic summarization. The numerical analysis of the proposed research method is conducted in the matlab simulation environment and compared with other research methods in terms various performance metrics. The simulation analysis proved proposed method tends to have better performance in terms of increased accuracy of document summarization outcome.


Sign in / Sign up

Export Citation Format

Share Document