Coupled term-term relation analysis for document clustering

Author(s):  
Xin Cheng ◽  
Duoqian Miao ◽  
Can Wang ◽  
Longbing Cao
2022 ◽  
Vol 24 (3) ◽  
pp. 0-0

Content-based recommender system is a subclass of information systems that recommends an item to the user based on its description. It suggests items such as news, documents, articles, webpages, journals, and more to users as per their inclination by comparing the key features of the items with key terms or features of user interest profiles. This paper proposes the new methodology using Non-IIDness based semantic term-term coupling from the content referred by users to enhance recommendation results. In the proposed methodology, the semantic relationship is analyzed by estimating the explicit and implicit relationship between terms. It associates terms that are semantically related in real world or are used inter-changeably such as synonyms. The underestimated features of user profiles have been enhanced after term-term relation analysis which results in improved similarity estimation of relevant items with the user profiles.The experimentation result proves that the proposed methodology improves the overall search and retrieval results as compared to the state-of-art algorithms.


2013 ◽  
Vol 26 (8) ◽  
pp. 693-698 ◽  
Author(s):  
Zhigang Zhang ◽  
Guixiang Zhang ◽  
Teng Liu ◽  
Cheng Qian ◽  
Yuanwang Deng

Author(s):  
Laith Mohammad Abualigah ◽  
Essam Said Hanandeh ◽  
Ahamad Tajudin Khader ◽  
Mohammed Abdallh Otair ◽  
Shishir Kumar Shandilya

Background: Considering the increasing volume of text document information on Internet pages, dealing with such a tremendous amount of knowledge becomes totally complex due to its large size. Text clustering is a common optimization problem used to manage a large amount of text information into a subset of comparable and coherent clusters. Aims: This paper presents a novel local clustering technique, namely, β-hill climbing, to solve the problem of the text document clustering through modeling the β-hill climbing technique for partitioning the similar documents into the same cluster. Methods: The β parameter is the primary innovation in β-hill climbing technique. It has been introduced in order to perform a balance between local and global search. Local search methods are successfully applied to solve the problem of the text document clustering such as; k-medoid and kmean techniques. Results: Experiments were conducted on eight benchmark standard text datasets with different characteristics taken from the Laboratory of Computational Intelligence (LABIC). The results proved that the proposed β-hill climbing achieved better results in comparison with the original hill climbing technique in solving the text clustering problem. Conclusion: The performance of the text clustering is useful by adding the β operator to the hill climbing.


Author(s):  
Ruina Bai ◽  
Ruizhang Huang ◽  
Yanping Chen ◽  
Yongbin Qin

Sign in / Sign up

Export Citation Format

Share Document