scholarly journals An Efficient Text Clustering Approach using Biased Affinity Propagation

2014 ◽  
Vol 96 (1) ◽  
pp. 1-4 ◽  
Author(s):  
Isha Sharma ◽  
Mahak Motwani
2020 ◽  
pp. 016555152091159
Author(s):  
Muhammad Qasim Memon ◽  
Yu Lu ◽  
Penghe Chen ◽  
Aasma Memon ◽  
Muhammad Salman Pathan ◽  
...  

Text segmentation (TS) is the process of dividing multi-topic text collections into cohesive segments using topic boundaries. Similarly, text clustering has been renowned as a major concern when it comes to multi-topic text collections, as they are distinguished by sub-topic structure and their contents are not associated with each other. Existing clustering approaches follow the TS method which relies on word frequencies and may not be suitable to cluster multi-topic text collections. In this work, we propose a new ensemble clustering approach (ECA) is a novel topic-modelling-based clustering approach, which induces the combination of TS and text clustering. We improvised a LDA-onto (LDA-ontology) is a TS-based model, which presents a deterioration of a document into segments (i.e. sub-documents), wherein each sub-document is associated with exactly one sub-topic. We deal with the problem of clustering when it comes to a document that is intrinsically related to various topics and its topical structure is missing. ECA is tested through well-known datasets in order to provide a comprehensive presentation and validation of clustering algorithms using LDA-onto. ECA exhibits the semantic relations of keywords in sub-documents and resultant clusters belong to original documents that they contain. Moreover, present research sheds the light on clustering performances and it indicates that there is no difference over performances (in terms of F-measure) when the number of topics changes. Our findings give above par results in order to analyse the problem of text clustering in a broader spectrum without applying dimension reduction techniques over high sparse data. Specifically, ECA provides an efficient and significant framework than the traditional and segment-based approach, such that achieved results are statistically significant with an average improvement of over 10.2%. For the most part, proposed framework can be evaluated in applications where meaningful data retrieval is useful, such as document summarization, text retrieval, novelty and topic detection.


2021 ◽  
Vol 20 (3) ◽  
pp. 288-298
Author(s):  
Famila Dwi Winati ◽  
Fauzan Romadlon

Bus Rapid Transit (BRT) is one of the alternative public transportations in urban areas, which has begun to be implemented in some cities of Indonesia. By finding out the effectiveness of BRT as a mass transportation system, it is necessary to study the expectations of users and non-users of the Trans Jateng Purwokerto-Purbalingga BRT regarding the perceived social, economic, and environmental impacts. This study uses the text Clustering method to group public opinion based on similarities so that it can be analyzed further for policymaking. As a result, the majority of the community gave positive expectations of BRT implementation’s perceived social, economic, and environmental benefits. On the other hand, public opinion on the presence of BRT is not always positive and has a significant impact. Improvements are needed in several aspects that are considered not to meet public expectations to maximize the function of BRT as a substitute for public transportation for private vehicles.


2011 ◽  
Vol 23 (4) ◽  
pp. 627-637 ◽  
Author(s):  
Renchu Guan ◽  
Xiaohu Shi ◽  
Maurizio Marchese ◽  
Chen Yang ◽  
Yanchun Liang

Sign in / Sign up

Export Citation Format

Share Document