An Indexing Method to Construct Unbalanced Layers for High-Dimensional Data in Mobile Environments

A top-k query processing is widely used in many applications and mobile environments. An index is used for efficient query processing and layer-based indexing methods are representative to perform the top-k query processing efficiently. However, the existing methods have a problem of high index building time for multidimensional and large data; thus, it is difficult to use them. In this paper, we proposed a new concept of constructing layer-based index, which is called unbalanced layer (UB-Layer). The existing methods construct a layer as a balanced layer with outermost data and wrap the rest of the input data. However, UB-Layer constructs a layer as an unbalanced layer that does not wrap the rest of the data. To construct UB-Layer, we fist divide the dimension of the input data into divided-dimensional data and compute the convex hull in each divided-dimensional data. And then, we combine divided-convex hull to build UB-Layer. We also propose UB-SelectAttribute algorithm for dividing the dimension with major attributes. We demonstrate the superiority of the proposed methods by the performance experiments.

Download Full-text

UB-H: an unbalanced-hierarchical layer binary-wise construction method for high-dimensional data

Computing ◽

10.1007/s00607-020-00871-0 ◽

2021 ◽

Author(s):

Sun-Young Ihm ◽

So-Hyun Park ◽

Young-Ho Park

Keyword(s):

Query Processing ◽

High Dimensional ◽

Data Generation ◽

Number Of Layers ◽

Index Building ◽

Efficient Query Processing ◽

Convex Hull Algorithm ◽

And Storage ◽

High Dimensional Datasets ◽

Entire Dataset

AbstractCloud computing, which is distributed, stored and managed, is drawing attention as data generation and storage volumes increase. In addition, research on green computing, which increases energy efficiency, is also widely studied. An index is constructed to retrieve huge dataset efficiently, and the layer-based indexing methods are widely used for efficient query processing. These methods construct a list of layers, so that only one layer is required for information retrieval instead of the entire dataset. The existing layer-based methods construct the layers using a convex hull algorithm. However, the execution time of this method is very high, especially in large, high-dimensional datasets. Furthermore, if the total number of layers increases, the query processing time also increases, resulting in efficient, but slow, query processing. In this paper, we propose an unbalanced-hierarchical layer method, which hierarchically divides the dimensions of input data to increase the total number of layers and reduce the index building time. We demonstrate that the proposed procedure significantly increases the total number of layers and reduces the index building time, compared to existing methods through the various experiments.

Download Full-text

Indexing of Image Features Using Quadtree

Emerging Technologies in Intelligent Applications for Image and Video Processing - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-4666-9685-3.ch008 ◽

2016 ◽

pp. 185-203

Author(s):

N. Puviarasan ◽

R. Bhavani

Keyword(s):

Extraction Method ◽

Dimensional Space ◽

Texture Features ◽

Image Features ◽

High Dimensional ◽

Shape Features ◽

Indexing Methods ◽

Feature Extraction Method ◽

Indexing Method ◽

Color Texture

In Content based image retrieval (CBIR) applications, the idea of indexing is mapping the extracted descriptors from images into a high-dimensional space. In this paper, visual features like color, texture and shape are considered. The color features are extracted using color coherence vector (CCV), texture features are obtained from Segmentation based Fractal Texture Analysis (SFTA). The shape features of an image are extracted using the Fourier Descriptors (FD) which is the contour based feature extraction method. All features of an image are then combined. After combining the color, texture and shape features using appropriate weights, the quadtree is used for indexing the images. It is experimentally found that the proposed indexing method using quadtree gives better performance than the other existing indexing methods.

Download Full-text

Computing Dense Cubes Embedded in Sparse Data

Research and Trends in Data Mining Technologies and Applications ◽

10.4018/978-1-59904-271-8.ch002 ◽

2007 ◽

pp. 29-52

Author(s):

Lixin Fu

Keyword(s):

Input Data ◽

Response Times ◽

Large Data ◽

Sparse Data ◽

High Dimensional ◽

Data Sets ◽

Aggregated Data ◽

Evaluation Algorithm ◽

Big Picture ◽

Computation Algorithms

In high-dimensional data sets, both the number of dimensions and the cardinalities of the dimensions are large and data is often very sparse, that is, most cubes are empty. For such large data sets, it is a well-known challenging problem to compute the aggregation of a measure over arbitrary combinations of dimensions efficiently. However, in real-world applications, users are usually not interested in all the sparse cubes, most of which are empty or contain only one or few tuples. Instead, they focus more on the “big picture” information the highly aggregated data, where the “where clauses” of the SQL queries involve only few dimensions. Although the input data set is sparse, this aggregate data is dense. The existing multi-pass, full-cube computation algorithms are prohibitively slow for this type of application involving very large input data sets. We propose a new dynamic data structure called Restricted Sparse Statistics Tree (RSST) and a novel cube evaluation algorithm, which are especially well suited for efficiently computing dense sub-cubes imbedded in high-dimensional sparse data sets. RSST only computes the aggregations of non-empty cube cells where the number of non-star coordinates (i.e., the number of group by attributes) is restricted to be no more than a user-specified threshold. Our innovative algorithms are scalable and I/O efficient. RSST is incrementally maintainable, which makes it suitable for data warehousing and the analysis of streaming data. We have compared our algorithms with top, state-of-the-art cube computation algorithms such as Dwarf and QCT in construction times, query response times, and data compression. Experiments demonstrate the excellent performance and good scalability of our approach.

Download Full-text

A Secure and Efficient Query Processing Algorithm Over Encrypted Database in Cloud Computing

2021 IEEE International Conference on Big Data and Smart Computing (BigComp) ◽

10.1109/bigcomp51126.2021.00049 ◽

2021 ◽

Author(s):

Hyeong-Jin Kim ◽

Hyeon-Jo Lee ◽

Jae-Woo Chang

Keyword(s):

Cloud Computing ◽

Query Processing ◽

Processing Algorithm ◽

Encrypted Database ◽

Efficient Query Processing

Download Full-text

XML-based RDF data management for efficient query processing

Procceedings of the 13th International Workshop on the Web and Databases - WebDB '10 ◽

10.1145/1859127.1859132 ◽

2010 ◽

Cited By ~ 4

Author(s):

Mo Zhou ◽

Yuqing Wu

Keyword(s):

Data Management ◽

Query Processing ◽

Rdf Data ◽

Efficient Query Processing

Download Full-text

An efficient bitmap indexing method for similarity search in high dimensional multimedia databases

2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763) ◽

10.1109/icme.2004.1394325 ◽

2005 ◽

Cited By ~ 1

Author(s):

Jinguk Jeong ◽

Jongho Nang

Keyword(s):

Similarity Search ◽

Multimedia Databases ◽

High Dimensional ◽

Indexing Method

Download Full-text

An Energy-Efficient Query Processing Algorithm for Wireless Sensor Networks

Ubiquitous Intelligence and Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-69293-5_30 ◽

2008 ◽

pp. 373-385 ◽

Cited By ~ 2

Author(s):

Jun-Zhao Sun

Keyword(s):

Wireless Sensor Networks ◽

Sensor Networks ◽

Query Processing ◽

Energy Efficient ◽

Wireless Sensor ◽

Processing Algorithm ◽

Efficient Query Processing

Download Full-text

Extreme Learning Machines on High Dimensional and Large Data Applications: A Survey

Mathematical Problems in Engineering ◽

10.1155/2015/103796 ◽

2015 ◽

Vol 2015 ◽

pp. 1-13 ◽

Cited By ~ 34

Author(s):

Jiuwen Cao ◽

Zhiping Lin

Keyword(s):

Linear System ◽

Video Processing ◽

Large Data ◽

Feedforward Neural Networks ◽

High Dimensional ◽

Recent Developments ◽

Learning Machine ◽

Hidden Layer ◽

Hidden Neurons ◽

The Cost

Extreme learning machine (ELM) has been developed for single hidden layer feedforward neural networks (SLFNs). In ELM algorithm, the connections between the input layer and the hidden neurons are randomly assigned and remain unchanged during the learning process. The output connections are then tuned via minimizing the cost function through a linear system. The computational burden of ELM has been significantly reduced as the only cost is solving a linear system. The low computational complexity attracted a great deal of attention from the research community, especially for high dimensional and large data applications. This paper provides an up-to-date survey on the recent developments of ELM and its applications in high dimensional and large data. Comprehensive reviews on image processing, video processing, medical signal processing, and other popular large data applications with ELM are presented in the paper.

Download Full-text