Preventing Overlaps in Agglomerative Hierarchical Conceptual Clustering

Author(s):  
Quentin Brabant ◽  
Amira Mouakher ◽  
Aurélie Bertaux
2004 ◽  
Vol 56 (1-3) ◽  
pp. 115-151 ◽  
Author(s):  
Nina Mishra ◽  
Dana Ron ◽  
Ram Swaminathan

Author(s):  
S. Ferilli ◽  
T. M. A. Basile ◽  
N. Di Mauro ◽  
M. Biba ◽  
F. Esposito

2020 ◽  
Vol 67 ◽  
pp. 509-547
Author(s):  
Maxime Chabert ◽  
Christine Solnon

We introduce the exactCover global constraint dedicated to the exact cover problem, the goal of which is to select subsets such that each element of a given set belongs to exactly one selected subset. This NP-complete problem occurs in many applications, and we more particularly focus on a conceptual clustering application. We introduce three propagation algorithms for exactCover, called Basic, DL, and DL+: Basic ensures the same level of consistency as arc consistency on a classical decomposition of exactCover into binary constraints, without using any specific data structure; DL ensures the same level of consistency as Basic but uses Dancing Links to efficiently maintain the relation between elements and subsets; and DL+ is a stronger propagator which exploits an extra property to filter more values than DL. We also consider the case where the number of selected subsets is constrained to be equal to a given integer variable k, and we show that this may be achieved either by combining exactCover with existing constraints, or by designing a specific propagator that integrates algorithms designed for the NValues constraint. These different propagators are experimentally evaluated on conceptual clustering problems, and they are compared with state-of-the-art declarative approaches. In particular, we show that our global constraint is competitive with recent ILP and CP models for mono-criterion problems, and it has better scale-up properties for multi-criteria problems.


Author(s):  
Floriana Esposito ◽  
Nicola Fanizzi ◽  
Claudia d’Amato

Author(s):  
R. Romero-Zaliz ◽  
C. Rubio-Escudero ◽  
O. Cordón ◽  
O. Harari ◽  
C. del Val ◽  
...  

Author(s):  
Leandro Krug Wives ◽  
José Palazzo Moreira de Oliveira ◽  
Stanley Loh

This chapter introduces a technique to cluster textual documents using concepts. Document clustering is a technique capable of organizing large amounts of documents in clusters of related information, which helps the localization of relevant information. Traditional document clustering techniques use words to represent the contents of the documents and the use of words may cause semantic mistakes. Concepts, instead, represent real world events and objects, and people employ them to express ideas, thoughts, opinions and intentions. Thus, concepts are more appropriate to represent the contents of a document and its use helps the comprehension of large document collections, since it is possible to summarize each cluster and rapidly identify its contents (i.e. concepts). To perform this task, the chapter presents a methodology to cluster documents using concepts and presents some practical experiments in a case study to demonstrate that the proposed approach achieves better results than the use of words.


Sign in / Sign up

Export Citation Format

Share Document