High-dimensional indexing algorithm based on the hyperplane tree-structure

AbstractSupervised machine learning, especially deep learning based on a wide variety of neural network architectures, have contributed tremendously to fields such as marketing, computer vision and natural language processing. However, development of un-supervised machine learning algorithms has been a bottleneck of artificial intelligence. Clustering is a fundamental unsupervised task in many different subjects. Unfortunately, no present algorithm is satisfactory for clustering of high dimensional data with strong nonlinear correlations. In this work, we propose a simple and highly efficient hierarchical clustering algorithm based on encoding by composition rank vectors and tree structure, and demonstrate its utility with clustering of protein structural domains. No record comparison, which is an expensive and essential common step to all present clustering algorithms, is involved. Consequently, it achieves linear time and space computational complexity hierarchical clustering, thus applicable to arbitrarily large datasets. The key factor in this algorithm is definition of composition, which is dependent upon physical nature of target data and therefore need to be constructed case by case. Nonetheless, the algorithm is general and applicable to any high dimensional data with strong nonlinear correlations. We hope this algorithm to inspire a rich research field of encoding based clustering well beyond composition rank vector trees.

Download Full-text

An efficient high-dimensional indexing method for content-based retrieval in large image databases

Signal Processing Image Communication ◽

10.1016/j.image.2009.09.001 ◽

2009 ◽

Vol 24 (10) ◽

pp. 775-790 ◽

Cited By ~ 4

Author(s):

I. Daoudi ◽

K. Idrissi ◽

S.E. Ouatik ◽

A. Baskurt ◽

D. Aboutajdine

Keyword(s):

Image Databases ◽

High Dimensional ◽

Content Based Retrieval ◽

Indexing Method ◽

High Dimensional Indexing ◽

Large Image Databases

Download Full-text

Efficient high-dimensional indexing by superimposing space-partitioning schemes

Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04. ◽

10.1109/ideas.2004.1319798 ◽

2004 ◽

Author(s):

J. Lukaszuk ◽

R. Orlandic

Keyword(s):

High Dimensional ◽

Space Partitioning ◽

High Dimensional Indexing

Download Full-text

Extending High-Dimensional Indexing Techniques Pyramid and iMinMax(θ): Lessons Learned

Big Data - Lecture Notes in Computer Science ◽

10.1007/978-3-642-39467-6_23 ◽

2013 ◽

pp. 253-267 ◽

Cited By ~ 1

Author(s):

Karthik Ganesan Pillai ◽

Liessman Sturlaugson ◽

Juan M. Banda ◽

Rafal A. Angryk

Keyword(s):

Lessons Learned ◽

High Dimensional ◽

Indexing Techniques ◽

High Dimensional Indexing

Download Full-text

Building High Dimensional Indexing Structure Based on Cluster Information for Better Space Utilization

Information Technology Journal ◽

10.3923/itj.2006.1038.1042 ◽

2006 ◽

Vol 5 (6) ◽

pp. 1038-1042

Author(s):

Sanjay Garg ◽

Ramesh Chandra Jain .

Keyword(s):

High Dimensional ◽

Space Utilization ◽

Indexing Structure ◽

High Dimensional Indexing

Download Full-text

High Dimensional Indexing

10.1007/springerreference_63841 ◽

2011 ◽

Keyword(s):

High Dimensional ◽

High Dimensional Indexing

Download Full-text

Design by a CBIR System Supporting High Level Concepts

Interactive Multimedia Systems ◽

10.4018/978-1-931777-07-0.ch015 ◽

2011 ◽

pp. 259-268

Author(s):

M. V. Ramakrishna ◽

S. Nepal ◽

S. Sumanasekara ◽

S. M.M. Tahaghoghi

Keyword(s):

Similarity Measures ◽

Linear Mapping ◽

Content Based Image Retrieval ◽

High Dimensional ◽

Test Bed ◽

Development Activity ◽

Indexing Structure ◽

Level Data ◽

High Dimensional Indexing ◽

High Level

Content Based Image Retrieval (CBIR) systems that are able to “retrieve images of Clinton with Lewinsky” are unrealistic at present. However, this area has seen much research and development activity since IBM’s QBIC announcement in 1994. The CHITRA CBIR system under development at the RMIT and Monash Universities, addresses the need for a test bed system. Users can dynamically incorporate new features and similarity measures in to the system, enabling it to act as a testbed for CBIR research. The system uses a 4-level data model we have developed and supports definition and querying of high level concepts such as MOUNTAIN and SUNSET. These advanced capabilities are supported by a powerful graphical query mechanism and a high-dimensional indexing structure based on linear mapping. In this paper we describe the design of the system, our contributions to the state of the art and provide some implementation details.

Download Full-text

An Efficient High-Dimensional Indexing Scheme Using a Clustering Technique for Content-Based Retrieval

2009 International Conference on Computational Science and Engineering ◽

10.1109/cse.2009.365 ◽

2009 ◽

Cited By ~ 1

Author(s):

Hyun-Jo Lee ◽

Hyeong-Il Kim ◽

Jae-Woo Chang

Keyword(s):

High Dimensional ◽

Content Based Retrieval ◽

Indexing Scheme ◽

Clustering Technique ◽

High Dimensional Indexing

Download Full-text

High-dimensional indexing algorithm based on the hyperplane tree-structure

An adaptive and efficient dimensionality reduction algorithm for high-dimensional indexing

Scalable high-dimensional indexing with Hadoop

Scalable hierarchical clustering by composition rank vector encoding and tree structure

An efficient high-dimensional indexing method for content-based retrieval in large image databases

Efficient high-dimensional indexing by superimposing space-partitioning schemes

Extending High-Dimensional Indexing Techniques Pyramid and iMinMax(θ): Lessons Learned

Building High Dimensional Indexing Structure Based on Cluster Information for Better Space Utilization

High Dimensional Indexing

Design by a CBIR System Supporting High Level Concepts

An Efficient High-Dimensional Indexing Scheme Using a Clustering Technique for Content-Based Retrieval

Export Citation Format