tree index Latest Research Papers

Support for region queries is crucial in geographic information systems, which process exact queries through spatial indexing to filter features and subsequently refine the selection. Although the filtering step has been extensively studied, the refinement step has received little attention. This research builds upon the QR−tree index, which decomposes space into hierarchical grids, registers features to the grids, and builds an R−tree for each grid, to develop a new QRB−tree index with two levels of optimization. In the first level, a bucket is introduced in every grid in the QR−tree index to accelerate the loading and search steps of a query region for the grids within the query region. In the second level, the number of candidate features to be eliminated is reduced by limiting the features to those registered to the grids covering the corners of the query region. Subsequently, an approach for determining the maximal grid level, which significantly affects the performance of the QR−tree index, is proposed. Direct comparisons of time costs with the QR−tree index and geohash index show that the QRB−tree index outperforms the other two approaches for rough queries in large query regions and exact queries in all cases.

Download Full-text

Efficient Indexing of Top-k Entities in Systems of Engagement with Extensions for Geo-tagged Entities

Data Science and Engineering ◽

10.1007/s41019-021-00173-1 ◽

2021 ◽

Author(s):

Anirban Mondal ◽

Ayaan Kakkar ◽

Nilesh Padhariya ◽

Mukesh Mohania

Keyword(s):

Response Times ◽

Recall Performance ◽

Single Parent ◽

Management Systems ◽

Enterprise Management ◽

Memory Consumption ◽

Tree Index ◽

Synthetic Datasets ◽

Inverted Indexes ◽

Candidate Set

AbstractNext-generation enterprise management systems are beginning to be developed based on the Systems of Engagement (SOE) model. We visualize an SOE as a set of entities. Each entity is modeled by a single parent document with dynamic embedded links (i.e., child documents) that contain multi-modal information about the entity from various networks. Since entities in an SOE are generally queried using keywords, our goal is to efficiently retrieve the top-k entities related to a given keyword-based query by considering the relevance scores of both their parent and child documents. Furthermore, we extend the afore-mentioned problem to incorporate the case where the entities are geo-tagged. The main contributions of this work are three-fold. First, it proposes an efficient bitmap-based approach for quickly identifying the candidate set of entities, whose parent documents contain all queried keywords. A variant of this approach is also proposed to reduce memory consumption by exploiting skews in keyword popularity. Second, it proposes the two-tier HI-tree index, which uses both hashing and inverted indexes, for efficient document relevance score lookups. Third, it proposes an R-tree-based approach to extend the afore-mentioned approaches for the case where the entities are geo-tagged. Fourth, it performs comprehensive experiments with both real and synthetic datasets to demonstrate that our proposed schemes are indeed effective in providing good top-k result recall performance within acceptable query response times.

Download Full-text

Managing Sparse Spatio-Temporal Data in SAVIME: an Evaluation of the Ph-tree Index

10.5753/sbbd.2021.17895 ◽

2021 ◽

Author(s):

Stiw Herrera ◽

Larissa Miguez da Silva ◽

Paulo Ricardo Reis ◽

Anderson Silva ◽

Fabio Porto

Keyword(s):

Sparse Data ◽

Scientific Data ◽

Efficient Implementation ◽

Temporal Data ◽

Indexing Structure ◽

Tree Index ◽

Spatio Temporal ◽

Data Ingestion ◽

Memory Indexing ◽

Array Databases

Scientific data is mainly multidimensional in its nature, presenting interesting opportunities for optimizations when managed by array databases. However, in scenarios where data is sparse, an efficient implementation is still required. In this paper, we investigate the adoption of the Ph-tree as an in-memory indexing structure for sparse data. We compare the performance in data ingestion and in both range and punctual queries, using SAVIME as the multidimensional array DBMS. Our experiments, using a real weather dataset, highlights the challenges involving providing a fast data ingestion, as proposed by SAVIME, and at the same time efficiently answering multidimensional queries on sparse data.

Download Full-text

Time Series Geographic Social Network Dynamic Preference Group Query

International Journal of Information Systems in the Service Sector ◽

10.4018/ijisss.2021100102 ◽

2021 ◽

Vol 13 (4) ◽

pp. 18-39

Author(s):

Yinglian Zhou ◽

Jifeng Chen

Keyword(s):

Time Series ◽

Social Network ◽

Social Impact ◽

Network Models ◽

User Preferences ◽

Network Dynamic ◽

Tree Index ◽

Query Algorithm ◽

Value Model ◽

Dynamic Preferences

Driven by experience and social impact of the new life, user preferences continue to change over time. In order to make up for the shortcomings of existing geographic social network models that often cannot obtain user dynamic preferences, a time-series geographic social network model was constructed to detect user dynamic preferences, a dynamic preference value model was built for user dynamic preference evaluation, and a dynamic preferences group query (DPG) was proposed in this paper . In order to optimize the efficiency of the DPG query algorithm, the UTC-tree index user timing check-in record is designed. UTC-tree avoids traversing all user check-in records in the query, accelerating user dynamic preference evaluation. Finally, the DPG query algorithm is used to implement a well-interacted DPG query system. Through a large number of comparative experiments, the validity of UTC-tree and the scalability of DPG query are verified.

Download Full-text

Cracking in-memory database index: A case study for Adaptive Radix Tree index

Information Systems ◽

10.1016/j.is.2021.101913 ◽

2021 ◽

pp. 101913

Author(s):

Gang Wu ◽

Yidong Song ◽

Guodong Zhao ◽

Wei Sun ◽

Donghong Han ◽

...

Keyword(s):

Database Index ◽

Tree Index

Download Full-text

Technical Perspective

ACM SIGMOD Record ◽

10.1145/3471485.3471495 ◽

2021 ◽

Vol 50 (1) ◽

pp. 41-41

Author(s):

Qin Zhang

Keyword(s):

Input Data ◽

Database System ◽

Fast Search ◽

Query Result ◽

Tree Index ◽

The Given

One of the most important functionalities of a database system is to answer queries. We are interested in the following question: If there exists more than one answer to the given query, which one should the database report? There are two apparent choices: to return all the valid answers or to return one of them. The problem with the former choice is that it is often time-prohibitive to search for all valid answers. In the latter choice, fairness may become an issue, since the index built for fast search may introduce bias to the query result. For example, the index may favor a certain portion of the input data (e.g., nodes near the root of a tree index) and with a higher chance, output an answer related to that portion than other portions. Such bias can sometimes lead to undesirable consequences.

Download Full-text

TLBtree: A Read/Write-Optimized Tree Index for Non-Volatile Memory

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00172 ◽

2021 ◽

Author(s):

Yongping Luo ◽

Peiquan Jin ◽

Qinglin Zhang ◽

Bin Cheng

Keyword(s):

Non Volatile Memory ◽

Tree Index ◽

Volatile Memory

Download Full-text

Updatable learned index with precise positions

Proceedings of the VLDB Endowment ◽

10.14778/3457390.3457393 ◽

2021 ◽

Vol 14 (8) ◽

pp. 1276-1288

Author(s):

Jiacheng Wu ◽

Yong Zhang ◽

Shimin Chen ◽

Jin Wang ◽

Yu Chen ◽

...

Keyword(s):

State Of The Art ◽

Real Life ◽

Index Structures ◽

Dynamic Adjustment ◽

New Paradigm ◽

Adjustment Strategy ◽

Tree Index ◽

Synthetic Datasets ◽

The Cost ◽

New Framework

Index plays an essential role in modern database engines to accelerate the query processing. The new paradigm of "learned index" has significantly changed the way of designing index structures in DBMS. The key insight is that indexes could be regarded as learned models that predict the position of a lookup key in the dataset. While such studies show promising results in both lookup time and index size, they cannot efficiently support update operations. Although recent studies have proposed some preliminary approaches to support update, they are at the cost of scarifying the lookup performance as they suffer from the overheads brought by imprecise predictions in the leaf nodes. In this paper, we propose LIPP, a brand new framework of learned index to address such issues. Similar with state-of-the-art learned index structures, LIPP is able to support all kinds of index operations, namely lookup query, range query, insert, delete, update and bulkload. Meanwhile, we overcome the limitations of previous studies by properly extending the tree structure when dealing with update operations so as to eliminate the deviation of location predicted by the models in the leaf nodes. Moreover, we further propose a dynamic adjustment strategy to ensure that the height of the tree index is tightly bounded and provide comprehensive theoretical analysis to illustrate it. We conduct an extensive set of experiments on several real-life and synthetic datasets. The results demonstrate that our method consistently outperforms state-of-the-art solutions, achieving by up to 4X for a broader class of workloads with different index operations.

Download Full-text

An improved data integrity validation model for cloud storage

MATEC Web of Conferences ◽

10.1051/matecconf/202133608003 ◽

2021 ◽

Vol 336 ◽

pp. 08003

Author(s):

Zhijian Qin ◽

Lin Huo ◽

Shicong Zhang

Keyword(s):

Cloud Storage ◽

Bloom Filter ◽

Data Integrity ◽

Index Structure ◽

Effective Protection ◽

Merkle Tree ◽

Cloud Server ◽

Tree Index ◽

Cloud Servers ◽

Verification Model

Data integrity validation is considered to be an important tool to solve the problem that cloud subscribers cannot accurately know whether there are non-subjective changes in the data they upload to cloud servers. In this paper, a data integrity verification model based on dynamic successor tree index structure, Bloom filter and Merkle tree is proposed. The block labels generated according to the features of the dynamic successor tree index structure can sense whether changes have been made to the user's data, while the Merkle tree can track the cha*nged data blocks, enabling the user to effectively verify the integrity of the data stored in the cloud server and provide more effective protection for data.

Download Full-text