A General Framework for Understanding Compressed Subspace Clustering Algorithms

Subspace clustering approaches cluster high dimensional data in different subspaces. It means grouping the data with different relevant subsets of dimensions. This technique has become very effective as a distance measure becomes ineffective in a high dimensional space. This chapter presents a novel evolutionary approach to a bottom up subspace clustering SUBSPACE_DE which is scalable to high dimensional data. SUBSPACE_DE uses a self-adaptive DBSCAN algorithm to perform clustering in data instances of each attribute and maximal subspaces. Self-adaptive DBSCAN clustering algorithms accept input from differential evolution algorithms. The proposed SUBSPACE_DE algorithm is tested on 14 datasets, both real and synthetic. It is compared with 11 existing subspace clustering algorithms. Evaluation metrics such as F1_Measure and accuracy are used. Performance analysis of the proposed algorithms is considerably better on a success rate ratio ranking in both accuracy and F1_Measure. SUBSPACE_DE also has potential scalability on high dimensional datasets.

Download Full-text

Improved Object Recognition with Decision Trees Using Subspace Clustering

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2016.p0041 ◽

2016 ◽

Vol 20 (1) ◽

pp. 41-48 ◽

Cited By ~ 1

Author(s):

Billy Peralta ◽

◽

Luis Alberto Caro

Keyword(s):

Object Recognition ◽

Decision Trees ◽

Clustering Algorithm ◽

Comprehensive Evaluation ◽

Recognition Performance ◽

Clustering Algorithms ◽

Subspace Clustering ◽

Visual Words ◽

Ensemble Techniques ◽

Standard Object

Generic object recognition algorithms usually require complex classificationmodels because of intrinsic difficulties arising from problems such as changes in pose, lighting conditions, or partial occlusions. Decision trees present an inexpensive alternative for classification tasks and offer the advantage of being simple to understand. On the other hand, a common scheme for object recognition is given by the appearances of visual words, also known as the bag-of-words method. Although multiple co-occurrences of visual words are more informative regarding visual classes, a comprehensive evaluation of such combinations is unfeasible because it would result in a combinatorial explosion. In this paper, we propose to obtain the multiple co-occurrences of visual words using a variant of the CLIQUE subspace-clustering algorithm for improving the object recognition performance of simple decision trees. Experiments on standard object datasets show that our method improves the accuracy of the classification of generic objects in comparison to traditional decision tree techniques that are similar, in terms of accuracy, to ensemble techniques. In future we plan to evaluate other variants of decision trees, and apply other subspace-clustering algorithms.

Download Full-text

Beyond linear subspace clustering: A comparative study of nonlinear manifold clustering algorithms

Computer Science Review ◽

10.1016/j.cosrev.2021.100435 ◽

2021 ◽

Vol 42 ◽

pp. 100435

Author(s):

Maryam Abdolali ◽

Nicolas Gillis

Keyword(s):

Comparative Study ◽

Linear Subspace ◽

Clustering Algorithms ◽

Subspace Clustering ◽

Manifold Clustering

Download Full-text

Clustering for High Dimensional Data: Density based Subspace Clustering Algorithms

International Journal of Computer Applications ◽

10.5120/10584-5732 ◽

2013 ◽

Vol 63 (20) ◽

pp. 29-35 ◽

Cited By ~ 1

Author(s):

Sunita Jahirabadkar ◽

Parag Kulkarni

Keyword(s):

Clustering Algorithms ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional ◽

Data Density

Download Full-text

A Novel Scalable Signature Based Subspace Clustering Approach for Big Data

International Journal of Information Technology and Web Engineering ◽

10.4018/ijitwe.2019040103 ◽

2019 ◽

Vol 14 (2) ◽

pp. 41-51 ◽

Cited By ~ 1

Author(s):

T. Gayathri ◽

D. Lalitha Bhaskari

Keyword(s):

Big Data ◽

Data Management ◽

Clustering Algorithms ◽

Synthetic Data ◽

Subspace Clustering ◽

Distance Measures ◽

Data Sets ◽

Management Tools ◽

Clustering Approach ◽

Different Dimensions

“Big data” as the name suggests is a collection of large and complicated data sets which are usually hard to process with on-hand data management tools or other conventional processing applications. A scalable signature based subspace clustering approach is presented in this article that would avoid identification of redundant clusters. Various distance measures are utilized to perform experiments that validate the performance of the proposed algorithm. Also, for the same purpose of validation, the synthetic data sets that are chosen have different dimensions, and their size will be distributed when opened with Weka. The F1 quality measure and the runtime of these synthetic data sets are computed. The performance of the proposed algorithm is compared with other existing clustering algorithms such as CLIQUE.INSCY and SUNCLU.

Download Full-text

A nonconvex formulation for low rank subspace clustering: algorithms and convergence analysis

Computational Optimization and Applications ◽

10.1007/s10589-018-0002-6 ◽

2018 ◽

Vol 70 (2) ◽

pp. 395-418 ◽

Cited By ~ 5

Author(s):

Hao Jiang ◽

Daniel P. Robinson ◽

René Vidal ◽

Chong You

Keyword(s):

Convergence Analysis ◽

Clustering Algorithms ◽

Subspace Clustering ◽

Low Rank

Download Full-text

P-Splines Based Clustering as a General Framework: Some Applications Using Different Clustering Algorithms

Studies in Classification, Data Analysis, and Knowledge Organization - Classification, (Big) Data Analysis and Statistical Learning ◽

10.1007/978-3-319-55708-3_20 ◽

2018 ◽

pp. 183-190

Author(s):

Carmela Iorio ◽

Gianluca Frasso ◽

Antonio D’Ambrosio ◽

Roberta Siciliano

Keyword(s):

General Framework ◽

Clustering Algorithms

Download Full-text

A General Framework for Agglomerative Hierarchical Clustering Algorithms

18th International Conference on Pattern Recognition (ICPR'06) ◽

10.1109/icpr.2006.69 ◽

2006 ◽

Cited By ~ 19

Author(s):

R.J. Gil-Garcia ◽

J.M. Badia-Contelles ◽

A. Pons-Porrata

Keyword(s):

Hierarchical Clustering ◽

General Framework ◽

Clustering Algorithms ◽

Agglomerative Hierarchical Clustering

Download Full-text

Low-rank sparse subspace clustering with a clean dictionary

Journal of Algorithms & Computational Technology ◽

10.1177/1748302621999620 ◽

2021 ◽

Vol 15 ◽

pp. 174830262199962

Author(s):

Cong-Zhe You ◽

Zhen-Qiu Shu ◽

Hong-Hui Fan

Keyword(s):

Optimization Problem ◽

Clustering Algorithms ◽

Subspace Clustering ◽

Low Rank ◽

Data Matrix ◽

Image Clustering ◽

Convex Optimization Problem ◽

Data Points ◽

Low Rank Representation ◽

Sparse Subspace Clustering

Low-Rank Representation (LRR) and Sparse Subspace Clustering (SSC) are considered as the hot topics of subspace clustering algorithms. SSC induces the sparsity through minimizing the l1-norm of the data matrix while LRR promotes a low-rank structure through minimizing the nuclear norm. In this paper, considering the problem of fitting a union of subspace to a collection of data points drawn from one more subspaces and corrupted by noise, we pose this problem as a non-convex optimization problem, where the goal is to decompose the corrupted data matrix as the sum of a clean and self-expressive dictionary plus a matrix of noise. We propose a new algorithm, named Low-Rank and Sparse Subspace Clustering with a Clean dictionary (LRS2C2), by combining SSC and LRR, as the representation is often both sparse and low-rank. The effectiveness of the proposed algorithm is demonstrated through experiments on motion segmentation and image clustering.

Download Full-text

Performance Analysis of Subspace Clustering Algorithms in Biological Data

IJARCCE ◽

10.17148/ijarcce.2015.4259 ◽

2015 ◽

pp. 268-273

Author(s):

Shilpi Chakraborty ◽

Bijoyeta Roy

Keyword(s):

Performance Analysis ◽

Clustering Algorithms ◽

Subspace Clustering ◽

Biological Data

Download Full-text