tree index
Recently Published Documents


TOTAL DOCUMENTS

128
(FIVE YEARS 30)

H-INDEX

6
(FIVE YEARS 2)

2021 ◽  
Vol 10 (11) ◽  
pp. 727
Author(s):  
Jieqing Yu ◽  
Yi Wei ◽  
Qi Chu ◽  
Lixin Wu

Support for region queries is crucial in geographic information systems, which process exact queries through spatial indexing to filter features and subsequently refine the selection. Although the filtering step has been extensively studied, the refinement step has received little attention. This research builds upon the QR−tree index, which decomposes space into hierarchical grids, registers features to the grids, and builds an R−tree for each grid, to develop a new QRB−tree index with two levels of optimization. In the first level, a bucket is introduced in every grid in the QR−tree index to accelerate the loading and search steps of a query region for the grids within the query region. In the second level, the number of candidate features to be eliminated is reduced by limiting the features to those registered to the grids covering the corners of the query region. Subsequently, an approach for determining the maximal grid level, which significantly affects the performance of the QR−tree index, is proposed. Direct comparisons of time costs with the QR−tree index and geohash index show that the QRB−tree index outperforms the other two approaches for rough queries in large query regions and exact queries in all cases.


Author(s):  
Anirban Mondal ◽  
Ayaan Kakkar ◽  
Nilesh Padhariya ◽  
Mukesh Mohania

AbstractNext-generation enterprise management systems are beginning to be developed based on the Systems of Engagement (SOE) model. We visualize an SOE as a set of entities. Each entity is modeled by a single parent document with dynamic embedded links (i.e., child documents) that contain multi-modal information about the entity from various networks. Since entities in an SOE are generally queried using keywords, our goal is to efficiently retrieve the top-k entities related to a given keyword-based query by considering the relevance scores of both their parent and child documents. Furthermore, we extend the afore-mentioned problem to incorporate the case where the entities are geo-tagged. The main contributions of this work are three-fold. First, it proposes an efficient bitmap-based approach for quickly identifying the candidate set of entities, whose parent documents contain all queried keywords. A variant of this approach is also proposed to reduce memory consumption by exploiting skews in keyword popularity. Second, it proposes the two-tier HI-tree index, which uses both hashing and inverted indexes, for efficient document relevance score lookups. Third, it proposes an R-tree-based approach to extend the afore-mentioned approaches for the case where the entities are geo-tagged. Fourth, it performs comprehensive experiments with both real and synthetic datasets to demonstrate that our proposed schemes are indeed effective in providing good top-k result recall performance within acceptable query response times.


2021 ◽  
Author(s):  
Stiw Herrera ◽  
Larissa Miguez da Silva ◽  
Paulo Ricardo Reis ◽  
Anderson Silva ◽  
Fabio Porto

Scientific data is mainly multidimensional in its nature, presenting interesting opportunities for optimizations when managed by array databases. However, in scenarios where data is sparse, an efficient implementation is still required. In this paper, we investigate the adoption of the Ph-tree as an in-memory indexing structure for sparse data. We compare the performance in data ingestion and in both range and punctual queries, using SAVIME as the multidimensional array DBMS. Our experiments, using a real weather dataset, highlights the challenges involving providing a fast data ingestion, as proposed by SAVIME, and at the same time efficiently answering multidimensional queries on sparse data.


Author(s):  
Yinglian Zhou ◽  
Jifeng Chen

Driven by experience and social impact of the new life, user preferences continue to change over time. In order to make up for the shortcomings of existing geographic social network models that often cannot obtain user dynamic preferences, a time-series geographic social network model was constructed to detect user dynamic preferences, a dynamic preference value model was built for user dynamic preference evaluation, and a dynamic preferences group query (DPG) was proposed in this paper . In order to optimize the efficiency of the DPG query algorithm, the UTC-tree index user timing check-in record is designed. UTC-tree avoids traversing all user check-in records in the query, accelerating user dynamic preference evaluation. Finally, the DPG query algorithm is used to implement a well-interacted DPG query system. Through a large number of comparative experiments, the validity of UTC-tree and the scalability of DPG query are verified.


2021 ◽  
pp. 101913
Author(s):  
Gang Wu ◽  
Yidong Song ◽  
Guodong Zhao ◽  
Wei Sun ◽  
Donghong Han ◽  
...  
Keyword(s):  

2021 ◽  
Vol 50 (1) ◽  
pp. 41-41
Author(s):  
Qin Zhang

One of the most important functionalities of a database system is to answer queries. We are interested in the following question: If there exists more than one answer to the given query, which one should the database report? There are two apparent choices: to return all the valid answers or to return one of them. The problem with the former choice is that it is often time-prohibitive to search for all valid answers. In the latter choice, fairness may become an issue, since the index built for fast search may introduce bias to the query result. For example, the index may favor a certain portion of the input data (e.g., nodes near the root of a tree index) and with a higher chance, output an answer related to that portion than other portions. Such bias can sometimes lead to undesirable consequences.


2021 ◽  
Vol 14 (8) ◽  
pp. 1276-1288
Author(s):  
Jiacheng Wu ◽  
Yong Zhang ◽  
Shimin Chen ◽  
Jin Wang ◽  
Yu Chen ◽  
...  

Index plays an essential role in modern database engines to accelerate the query processing. The new paradigm of "learned index" has significantly changed the way of designing index structures in DBMS. The key insight is that indexes could be regarded as learned models that predict the position of a lookup key in the dataset. While such studies show promising results in both lookup time and index size, they cannot efficiently support update operations. Although recent studies have proposed some preliminary approaches to support update, they are at the cost of scarifying the lookup performance as they suffer from the overheads brought by imprecise predictions in the leaf nodes. In this paper, we propose LIPP, a brand new framework of learned index to address such issues. Similar with state-of-the-art learned index structures, LIPP is able to support all kinds of index operations, namely lookup query, range query, insert, delete, update and bulkload. Meanwhile, we overcome the limitations of previous studies by properly extending the tree structure when dealing with update operations so as to eliminate the deviation of location predicted by the models in the leaf nodes. Moreover, we further propose a dynamic adjustment strategy to ensure that the height of the tree index is tightly bounded and provide comprehensive theoretical analysis to illustrate it. We conduct an extensive set of experiments on several real-life and synthetic datasets. The results demonstrate that our method consistently outperforms state-of-the-art solutions, achieving by up to 4X for a broader class of workloads with different index operations.


2021 ◽  
Vol 336 ◽  
pp. 08003
Author(s):  
Zhijian Qin ◽  
Lin Huo ◽  
Shicong Zhang

Data integrity validation is considered to be an important tool to solve the problem that cloud subscribers cannot accurately know whether there are non-subjective changes in the data they upload to cloud servers. In this paper, a data integrity verification model based on dynamic successor tree index structure, Bloom filter and Merkle tree is proposed. The block labels generated according to the features of the dynamic successor tree index structure can sense whether changes have been made to the user's data, while the Merkle tree can track the cha*nged data blocks, enabling the user to effectively verify the integrity of the data stored in the cloud server and provide more effective protection for data.


Sign in / Sign up

Export Citation Format

Share Document