A Grid-Based Clustering Algorithm for High-Dimensional Data Streams

Advanced Data Mining and Applications - Lecture Notes in Computer Science ◽

10.1007/11527503_97 ◽

2005 ◽

pp. 824-831 ◽

Cited By ~ 10

Author(s):

Yansheng Lu ◽

Yufen Sun ◽

Guiping Xu ◽

Gang Liu

Keyword(s):

Data Streams ◽

Clustering Algorithm ◽

High Dimensional Data ◽

High Dimensional ◽

Grid Based

Download Full-text

A Grid-Based Subspace Clustering Algorithm for High-Dimensional Data Streams

Web Information Systems – WISE 2006 Workshops - Lecture Notes in Computer Science ◽

10.1007/11906070_4 ◽

2006 ◽

pp. 37-48 ◽

Cited By ~ 4

Author(s):

Yufen Sun ◽

Yansheng Lu

Keyword(s):

Data Streams ◽

Clustering Algorithm ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional ◽

Grid Based

Download Full-text

Approximate Trace of Grid-Based Clusters over High Dimensional Data Streams

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-540-71701-0_82 ◽

2007 ◽

pp. 753-760

Author(s):

Nam Hun Park ◽

Won Suk Lee

Keyword(s):

Data Streams ◽

High Dimensional Data ◽

High Dimensional ◽

Grid Based

Download Full-text

A fast subspace partition clustering algorithm for high dimensional data streams

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems ◽

10.1109/icicisys.2009.5357796 ◽

2009 ◽

Cited By ~ 1

Author(s):

Zhongping Zhang ◽

Hao Wang

Keyword(s):

Data Streams ◽

Clustering Algorithm ◽

High Dimensional Data ◽

High Dimensional ◽

Subspace Partition ◽

Partition Clustering

Download Full-text

A Weighted Subspace Clustering Algorithm in High-Dimensional Data Streams

2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC) ◽

10.1109/icicic.2009.64 ◽

2009 ◽

Cited By ~ 1

Author(s):

Jiadong Ren ◽

Lining Li ◽

ChangZhen Hu

Keyword(s):

Data Streams ◽

Clustering Algorithm ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional

Download Full-text

Irregular Grid-Based Clustering over High-Dimensional Data Streams

2010 First International Conference on Pervasive Computing, Signal Processing and Applications ◽

10.1109/pcspa.2010.195 ◽

2010 ◽

Author(s):

GuiBin Hou ◽

RuiXia Yao ◽

JiaDong Ren ◽

ChangZhen Hu

Keyword(s):

Data Streams ◽

High Dimensional Data ◽

High Dimensional ◽

Irregular Grid ◽

Grid Based

Download Full-text

A Fast Clustering Algorithm for Large-scale and High Dimensional Data

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2009.00859 ◽

2009 ◽

Vol 35 (7) ◽

pp. 859-866

Author(s):

Ming LIU ◽

Xiao-Long WANG ◽

Yuan-Chao LIU

Keyword(s):

Large Scale ◽

Clustering Algorithm ◽

High Dimensional Data ◽

High Dimensional

Download Full-text

A meta-heuristic density-based subspace clustering algorithm for high-dimensional data

Soft Computing ◽

10.1007/s00500-021-05973-1 ◽

2021 ◽

Author(s):

Parul Agarwal ◽

Shikha Mehta ◽

Ajith Abraham

Keyword(s):

Clustering Algorithm ◽

High Dimensional Data ◽

Subspace Clustering ◽

High Dimensional

Download Full-text

A two-stage online monitoring procedure for high-dimensional data streams

Journal of Quality Technology ◽

10.1080/00224065.2018.1507562 ◽

2018 ◽

Vol 51 (4) ◽

pp. 392-406 ◽

Cited By ~ 1

Author(s):

Jun Li

Keyword(s):

Data Streams ◽

Online Monitoring ◽

High Dimensional Data ◽

High Dimensional ◽

Two Stage ◽

Monitoring Procedure

Download Full-text

Scalable hierarchical clustering by composition rank vector encoding and tree structure

10.1101/2020.04.12.038026 ◽

2020 ◽

Author(s):

Xiao Lai ◽

Pu Tian

Keyword(s):

Machine Learning ◽

Hierarchical Clustering ◽

Clustering Algorithm ◽

High Dimensional Data ◽

Machine Learning Algorithms ◽

Tree Structure ◽

Supervised Machine Learning ◽

High Dimensional ◽

Rank Vector ◽

Nonlinear Correlations

AbstractSupervised machine learning, especially deep learning based on a wide variety of neural network architectures, have contributed tremendously to fields such as marketing, computer vision and natural language processing. However, development of un-supervised machine learning algorithms has been a bottleneck of artificial intelligence. Clustering is a fundamental unsupervised task in many different subjects. Unfortunately, no present algorithm is satisfactory for clustering of high dimensional data with strong nonlinear correlations. In this work, we propose a simple and highly efficient hierarchical clustering algorithm based on encoding by composition rank vectors and tree structure, and demonstrate its utility with clustering of protein structural domains. No record comparison, which is an expensive and essential common step to all present clustering algorithms, is involved. Consequently, it achieves linear time and space computational complexity hierarchical clustering, thus applicable to arbitrarily large datasets. The key factor in this algorithm is definition of composition, which is dependent upon physical nature of target data and therefore need to be constructed case by case. Nonetheless, the algorithm is general and applicable to any high dimensional data with strong nonlinear correlations. We hope this algorithm to inspire a rich research field of encoding based clustering well beyond composition rank vector trees.

Download Full-text

Fuzzy C Means Clustering Algorithm for High Dimensional Data Using Feature Subset Selection Technique

IOSR Journal of Computer Engineering ◽

10.9790/0661-16226469 ◽

2014 ◽

Vol 16 (2) ◽

pp. 64-69 ◽

Cited By ~ 1

Author(s):

N. Manjula ◽

◽

S. Pandiarajan ◽

J. Jagadeesan

Keyword(s):

Clustering Algorithm ◽

High Dimensional Data ◽

Subset Selection ◽

Feature Subset Selection ◽

High Dimensional ◽

Feature Subset ◽

Selection Technique ◽

Fuzzy C Means ◽

Fuzzy C Means Clustering

Download Full-text