On Cluster Extraction from Relational Data UsingL1-Regularized Possibilistic Assignment Prototype Algorithm

This paper proposes entropy-basedL1-regularized possibilistic clustering and a method of sequential cluster extraction from relational data.Sequential cluster extractionmeans that the algorithm extracts cluster one by one. The assignment prototype algorithmis a typical clustering method for relational data. The membership degree of each object to each cluster is calculated directly from dissimilarities between objects. An entropy-basedL1-regularized possibilistic assignment prototype algorithm is proposed first to induce belongingness for a membership grade. An algorithm of sequential cluster extraction based on the proposed method is constructed and the effectiveness of the proposed methods is shown through numerical examples.

Download Full-text

On Fuzzy Non-Metric Model for Data with Tolerance and its Application to Incomplete Data Clustering

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2016.p0571 ◽

2016 ◽

Vol 20 (4) ◽

pp. 571-579 ◽

Cited By ~ 1

Author(s):

Yasunori Endo ◽

◽

Tomoyuki Suzuki ◽

Naohiko Kinoshita ◽

Yukihiro Hamasuna ◽

...

Keyword(s):

Data Clustering ◽

Incomplete Data ◽

Clustering Algorithm ◽

Uncertain Data ◽

Data Sets ◽

Membership Degree ◽

Clustering Methods ◽

Clustering Method ◽

Numerical Examples ◽

Metric Model

The fuzzy non-metric model (FNM) is a representative non-hierarchical clustering method, which is very useful because the belongingness or the membership degree of each datum to each cluster can be calculated directly from the dissimilarities between data and the cluster centers are not used. However, the original FNM cannot handle data with uncertainty. In this study, we refer to the data with uncertainty as “uncertain data,” e.g., incomplete data or data that have errors. Previously, a methods was proposed based on the concept of a tolerance vector for handling uncertain data and some clustering methods were constructed according to this concept, e.g. fuzzyc-means for data with tolerance. These methods can handle uncertain data in the framework of optimization. Thus, in the present study, we apply the concept to FNM. First, we propose a new clustering algorithm based on FNM using the concept of tolerance, which we refer to as the fuzzy non-metric model for data with tolerance. Second, we show that the proposed algorithm can handle incomplete data sets. Third, we verify the effectiveness of the proposed algorithm based on comparisons with conventional methods for incomplete data sets in some numerical examples.

Download Full-text

On Entropy Based Fuzzy Non Metric Model – Proposal, Kernelization and Pairwise Constraints –

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2012.p0169 ◽

2012 ◽

Vol 16 (1) ◽

pp. 169-173 ◽

Cited By ~ 6

Author(s):

Yasunori Endo ◽

Keyword(s):

Kernel Functions ◽

Clustering Method ◽

Pairwise Constraints ◽

Numerical Examples ◽

Membership Grade ◽

Metric Model

The fuzzy non metric model is a kind of clustering method in which belongingness or the membership grade of each datum to each cluster is calculated directly from dissimilarities between data, and cluster centers are not used. In this paper, we first construct a new fuzzy non metric model with entropy regularization. Second, we kernelize the proposed method by introducing kernel functions. Third, we consider pairwise constraints with the proposed method. We then confirm the above methods through some simple numerical examples.

Download Full-text

Non Metric Model Based on Rough Set Representation

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2013.p0540 ◽

2013 ◽

Vol 17 (4) ◽

pp. 540-551 ◽

Cited By ~ 2

Author(s):

Yasunori Endo ◽

◽

Ayako Heki ◽

Yukihiro Hamasuna ◽

◽

...

Keyword(s):

Rough Set ◽

Fuzzy Set ◽

Rough Sets ◽

Clustering Algorithms ◽

Clustering Method ◽

Numerical Examples ◽

Upper Approximation ◽

Membership Grade ◽

Fuzzy Degree ◽

Metric Model

The non metricmodel is a kind of clustering method in which belongingness or the membership grade of each object in each cluster is calculated directly from dissimilarities between objects and in which cluster centers are not used. The clustering field has recently begun to focus on rough set representation instead of fuzzy set representation. Conventional clustering algorithms classify a set of objects into clusters with clear boundaries, that is, one object must belong to one cluster. Many objects in the real world, however, belong to more than one cluster because cluster boundaries overlap each other. Fuzzy set representation of clusters makes it possible for each object to belong to more than one cluster. The fuzzy degree of membership may, however, be too descriptive for interpreting clustering results. Rough set representation handles such cases. Clustering based on rough sets could provide a solution that is less restrictive than conventional clustering and more descriptive than fuzzy clustering. This paper covers two types of Rough-set-based Non Metric model (RNM). One algorithm is the Roughset-based Hard Non Metric model (RHNM) and the other is the Rough-set-based Fuzzy Non Metric model (RFNM). In both algorithms, clusters are represented by rough sets and each cluster consists of lower and upper approximation. The effectiveness of proposed algorithms is evaluated through numerical examples.

Download Full-text

Fuzzy Co-Clustering Algorithms Based on Fuzzy Relational Clustering and TIBA Imputation

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2014.p0182 ◽

2014 ◽

Vol 18 (2) ◽

pp. 182-189 ◽

Cited By ~ 9

Author(s):

Yuchi Kanzawa ◽

Keyword(s):

Objective Function ◽

Clustering Algorithms ◽

Relational Data ◽

Clustering Method ◽

Numerical Examples ◽

Clustering Problem ◽

Relational Clustering

In this paper, two types of fuzzy co-clustering algorithms are proposed. First, it is shown that the base of the objective function for the conventional fuzzy co-clustering method is very similar to the base for entropy-regularized fuzzy nonmetric model. Next, it is shown that the non-sense clustering problem in the conventional fuzzy co-clustering algorithms is identical to that in fuzzy nonmetric model algorithms, in the case that all dissimilarities among rows and columns are zero. Based on this discussion, a method is proposed applying entropy-regularized fuzzy nonmetric model after all dissimilarities among rows and columns are set to some values using a TIBA imputation technique. Furthermore, since relational fuzzy cmeans is similar to fuzzy nonmetricmodel, in the sense that both methods are designed for homogeneous relational data, a method is proposed applying entropyregularized relational fuzzyc-means after imputing all dissimilarities among rows and columns with TIBA. Some numerical examples are presented for the proposed methods.

Download Full-text

Heuristic possibilistic clustering for detecting optimal number of elements in fuzzy clusters

Foundations of Computing and Decision Sciences ◽

10.1515/fcds-2016-0003 ◽

2016 ◽

Vol 41 (1) ◽

pp. 45-76 ◽

Cited By ~ 2

Author(s):

Dmitri A. Viattchenin

Keyword(s):

Optimal Number ◽

Experimental Results ◽

Numerical Examples ◽

Relational Clustering ◽

Possibilistic Clustering ◽

Controls Cluster

AbstractThe paper deals with the problem of discovering fuzzy clusters with optimal number of elements in heuristic possibilistic clustering. The relational clustering procedure using a parameter that controls cluster sizes is considered and a technique for detecting the optimal number of elements in fuzzy clusters is proposed. The effectiveness of the proposed technique is illustrated through numerical examples. Experimental results are discussed and some preliminary conclusions are formulated.

Download Full-text

Classification and Space Cluster for Visualizing GeoInformation

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2019010102 ◽

2019 ◽

Vol 15 (1) ◽

pp. 19-38

Author(s):

Toshihiro Osaragi

Keyword(s):

Spatial Data ◽

Spatial Clustering ◽

Effective Means ◽

Information Criterion ◽

Original Data ◽

Spatial Clusters ◽

Clustering Method ◽

Numerical Examples ◽

Loss Of Information ◽

Tokyo Metropolitan Area

It is necessary to classify numerical values of spatial data when representing them on a map so that, visually, it can be as clearly understood as possible. Inevitably some loss of information from the original data occurs in the process of this classification. A gate loss of information might lead to a misunderstanding of the nature of original data. At the same time, when we understand the spatial distribution of attribute values, forming spatial clusters is regarded as an effective means, in which values can be regarded as statistically equivalent and distribute continuous in the same patches. In this study, a classification method for organizing spatial data is proposed, in which any loss of information is minimized. Also, a spatial clustering method based on Akaike's Information Criterion is proposed. Some numerical examples of their applications are shown using actual spatial data for the Tokyo metropolitan area.

Download Full-text

Fuzzified Even-Sized Clustering Based on Optimization

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2018.p0537 ◽

2018 ◽

Vol 22 (4) ◽

pp. 537-543 ◽

Cited By ~ 2

Author(s):

Kei Kitajima ◽

Yasunori Endo ◽

Yukihiro Hamasuna ◽

◽

...

Keyword(s):

Data Analysis ◽

Fuzzy Clustering ◽

Cluster Size ◽

Clustering Algorithm ◽

Clustering Method ◽

Numerical Examples ◽

Fuzzy Clustering Method

Clustering is a method of data analysis without the use of supervised data. Even-sized clustering based on optimization (ECBO) is a clustering algorithm that focuses on cluster size with the constraints that cluster sizes must be the same. However, this constraints makes ECBO inconvenient to apply in cases where a certain margin of cluster size is allowed. It is believed that this issue can be overcome by applying a fuzzy clustering method. Fuzzy clustering can represent the membership of data to clusters more flexible. In this paper, we propose a new even-sized clustering algorithm based on fuzzy clustering and verify its effectiveness through numerical examples.

Download Full-text

A Robust Automatic Merging Possibilistic Clustering Method

IEEE Transactions on Fuzzy Systems ◽

10.1109/tfuzz.2010.2077640 ◽

2011 ◽

Vol 19 (1) ◽

pp. 26-41 ◽

Cited By ~ 43

Author(s):

Miin-Shen Yang ◽

Chien-Yo Lai

Keyword(s):

Clustering Method ◽

Possibilistic Clustering

Download Full-text

Image Segmentation Using The Enhanced Possibilistic Clustering Method

Information Technology Journal ◽

10.3923/itj.2007.541.546 ◽

2007 ◽

Vol 6 (4) ◽

pp. 541-546 ◽

Cited By ~ 5

Author(s):

Zhenping Xie ◽

Shitong Wang ◽

Dian You Zhang ◽

F.L. Chung ◽

Hanbin .

Keyword(s):

Image Segmentation ◽

Clustering Method ◽

Possibilistic Clustering

Download Full-text

On Sequential Cluster Extraction Based onL1-Regularized Possibilisticc-Means

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2015.p0655 ◽

2015 ◽

Vol 19 (5) ◽

pp. 655-661 ◽

Cited By ~ 3

Author(s):

Yukihiro Hamasuna ◽

◽

Yasunori Endo ◽

Keyword(s):

Rand Index ◽

Clustering Methods ◽

Number Of Clusters ◽

Numerical Examples ◽

Allocation Rules ◽

Possibilistic Clustering ◽

Extraction Algorithm ◽

The Relationship

Sequential cluster extraction algorithms are useful clustering methods that extract clusters one by one without the number of clusters having to be determined in advance. Typical examples of these algorithms are sequential hardc-means (SHCM) and possibilistic clustering (PCM) based algorithms. Two types ofL1-regularized possibilistic clustering are proposed to induce crisp and possibilistic allocation rules and to construct a novel sequential cluster extraction algorithm. The relationship between the proposed method and SHCM is also discussed. The effectiveness of the proposed method is verified through numerical examples. Results show that the entropy-based method yields better results for the Rand Index and the number of extracted clusters.

Download Full-text