A Uighur Automatic Summarization Method Based on Sub-theme Division

Author(s):  
Xiaodong Yan
Author(s):  
Vladimir S. Simankov ◽  
Demid M. Tolkachev

The article deals with the relevant problem of automatic arbitrary text summary obtaining. Several works in this field are analyzed using comparison and classification methods. The problem of obtaining a text summary by means of an answer to a random question is emphasized. The identification of semantic relations between sentences using a set of rules based on syntax and semantics of a language is described. These rules are represented in the form of regular expressions – patterns that consist of characters and metacharacters and set search rules. Taking into account semantic coherence features, an improved method of sentences similarity calculation to identify the measure of inclusion of one sentence into another one is developed. This method helps to define more precisely logical stress on the words within automatic summarization and detect contradictions. A modified automatic summarization method, oriented at a specific problem is suggested. It is concluded that the proposed method is quiet effective in the process of automatic search for answers to questions in the Internet.


2011 ◽  
Vol 271-273 ◽  
pp. 154-157
Author(s):  
Xin Lai Tang ◽  
Xiao Rong Wang

This paper proposes a special Chinese automatic summarization method based on Concept-Obtained and Improved K-means Algorithm. The idea of our approach is to obtain concepts of words based on HowNet, and use concept as feature, instead of word. We use conceptual vector space model and Improved K-means Algorithm to form a summarization. Experimental results indicate a clear superiority of the proposed method over the traditional method under the proposed evaluation scheme.


2020 ◽  
Vol 13 (5) ◽  
pp. 977-986
Author(s):  
Srinivasa Rao Kongara ◽  
Dasika Sree Rama Chandra Murthy ◽  
Gangadhara Rao Kancherla

Background: Text summarization is the process of generating a short description of the entire document which is more difficult to read. This method provides a convenient way of extracting the most useful information and a short summary of the documents. In the existing research work, this is focused by introducing the Fuzzy Rule-based Automated Summarization Method (FRASM). Existing work tends to have various limitations which might limit its applicability to the various real-world applications. The existing method is only suitable for the single document summarization where various applications such as research industries tend to summarize information from multiple documents. Methods: This paper proposed Multi-document Automated Summarization Method (MDASM) to introduce the summarization framework which would result in the accurate summarized outcome from the multiple documents. In this work, multi-document summarization is performed whereas in the existing system only single document summarization was performed. Initially document clustering is performed using modified k means cluster algorithm to group the similar kind of documents that provides the same meaning. This is identified by measuring the frequent term measurement. After clustering, pre-processing is performed by introducing the Hybrid TF-IDF and Singular value decomposition technique which would eliminate the irrelevant content and would result in the required content. Then sentence measurement is one by introducing the additional metrics namely Title measurement in addition to the existing work metrics to accurately retrieve the sentences with more similarity. Finally, a fuzzy rule system is applied to perform text summarization. Results: The overall evaluation of the research work is conducted in the MatLab simulation environment from which it is proved that the proposed research method ensures the optimal outcome than the existing research method in terms of accurate summarization. MDASM produces 89.28% increased accuracy, 89.28% increased precision, 89.36% increased recall value and 70% increased the f-measure value which performs better than FRASM. Conclusion: The summarization processes carried out in this work provides the accurate summarized outcome.


Author(s):  
Dragomir R. Radev ◽  
Weiguo Fan

Sign in / Sign up

Export Citation Format

Share Document