query efficiency Latest Research Papers

AbstractBlockchain technology has the characteristics of decentralization and tamper resistance, which can store data safely and reduce the cost of trust effectively. However, the existing blockchain system has weak performance in data management, and only supports traversal queries with transaction hashes as keywords. The query method based on the account transaction trace chain (ATTC) improves the query efficiency of historical transactions of the account. However, the efficiency of querying accounts with longer transaction chains has not been effectively improved. Given the inefficiency and single method of the ATTC index in the query, we propose a subchain-based account transaction chain (SCATC) index structure. First, the account transaction chain is divided into subchains, and the last block of each subchain is connected by a hash pointer. The block-by-block query mode in ATTC is converted to the subchain-by-subchain query mode, which shortens the query path. Multiple transactions of the same account in the same block are merged and stored, which simplifies the construction cost of the index and saves storage resources. then, the construction algorithm and query algorithm is given for the SCATC index structure. Simulation analysis shows that the SCATC index structure significantly improves query efficiency.

Download Full-text

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

ACM/IMS Transactions on Data Science ◽

10.1145/3465375 ◽

2021 ◽

Vol 2 (3) ◽

pp. 1-28

Author(s):

Jie Song ◽

Qiang He ◽

Feifei Chen ◽

Ye Yuan ◽

Ge Yu

Keyword(s):

Big Data ◽

Query Processing ◽

State Of The Art ◽

Data Placement ◽

Probabilistic Data ◽

Trade Off ◽

Query Performance ◽

Data Query ◽

Query Efficiency ◽

The Given

In big data query processing, there is a trade-off between query accuracy and query efficiency, for example, sampling query approaches trade-off query completeness for efficiency. In this article, we argue that query performance can be significantly improved by slightly losing the possibility of query completeness, that is, the chance that a query is complete. To quantify the possibility, we define a new concept, Probability of query Completeness (hereinafter referred to as PC). For example, If a query is executed 100 times, PC = 0.95 guarantees that there are no more than 5 incomplete results among 100 results. Leveraging the probabilistic data placement and scanning, we trade off PC for query performance. In the article, we propose PoBery (POssibly-complete Big data quERY), a method that supports neither complete queries nor incomplete queries, but possibly-complete queries. The experimental results conducted on HiBench prove that PoBery can significantly accelerate queries while ensuring the PC. Specifically, it is guaranteed that the percentage of complete queries is larger than the given PC confidence. Through comparison with state-of-the-art key-value stores, we show that while Drill-based PoBery performs as fast as Drill on complete queries, it is 1.7 ×, 1.1 ×, and 1.5 × faster on average than Drill, Impala, and Hive, respectively, on possibly-complete queries.

Download Full-text

Time-aware collective spatial keyword query

Computer Science and Information Systems ◽

10.2298/csis200131034c ◽

2021 ◽

pp. 34-34

Author(s):

Zijun Chen ◽

Tingting Zhao ◽

Wenyuan Liu

Keyword(s):

Real Life ◽

Research Topic ◽

Temporal Information ◽

Experimental Results ◽

Keyword Query ◽

Evaluation Functions ◽

Spatial Keyword Query ◽

Query Efficiency ◽

Time Aware

The collective spatial keyword query is a hot research topic in the database community in recent years, which considers both the positional relevance to the query location and textual relevance to the query keywords. However, in real life, the temporal information of object is not always valid. Based on this, we define a new query, namely time-aware collective spatial keyword query (TCoSKQ), which considers the positional relevance, textual relevance, and temporal relevance between objects and query at the same time. Two evaluation functions are defined to meet different needs of users, for each of which we propose an algorithm. Effective pruning strategies are proposed to improve query efficiency based on the two algorithms. Finally, the experimental results show that the proposed algorithms are efficient and scalable.

Download Full-text

Self-Attention and Adversary Guided Hashing Network for Cross-Modal Retrieval

10.20944/preprints202009.0416.v1 ◽

2020 ◽

Author(s):

Shubai Chen ◽

Li Wang ◽

Song Wu

Keyword(s):

Semantic Information ◽

State Of The Art ◽

Local Minima ◽

Adversarial Learning ◽

High Ranking ◽

Benchmark Datasets ◽

Semantic Relevance ◽

Triplet Loss ◽

Query Efficiency ◽

Hash Codes

Recently deep cross-modal hashing networks have received increasing interests due to its superior query efficiency and low storage cost. However, most of existing methods concentrate less on hash representations learning part, which means the semantic information of data cannot be fully used. Furthermore, they may neglect the high-ranking relevance and consistency of hash codes. To solve these problems, we propose a Self-Attention and Adversary Guided Hashing Network (SAAGHN). Specifically, it employs self-attention mechanism in hash representations learning part to extract rich semantic relevance information. Meanwhile, in order to keep invariability of hash codes, adversarial learning is adopted in the hash codes learning part. In addition, to generate higher-ranking hash codes and avoid local minima early, a new batch semi-hard cosine triplet loss and a cosine quantization loss are proposed. Extensive experiments on two benchmark datasets have shown that SAAGHN outperforms other baselines and achieves the state-of-the-art performance.

Download Full-text

Temporal RDF(S) Data Storage and Query with HBase

Journal of Computing and Information Technology ◽

10.20532/cit.2019.1004801 ◽

2020 ◽

Vol 27 (4) ◽

pp. 17-30

Author(s):

Li Yan ◽

Zheqing Zhang ◽

Dan Yang

Keyword(s):

Information Management ◽

Data Storage ◽

Temporal Information ◽

Storage Model ◽

Web Resources ◽

Metadata Model ◽

Query Efficiency ◽

Rdf Data ◽

Description Framework ◽

Resource Description

Resource Description Framework (RDF) is a metadata model recommended by World Wide Web Consortium (W3C) for describing the Web resources. With the arrival of the era of Big Data, very large amounts of RDF data are continuously being created and need to be stored for management. The traditional centralized RDF storage models cannot meet the need of largescale RDF data storage. Meanwhile, the importance of temporal information management and processing has been acknowledged by academia and industry. In this paper, we propose a storage model to store temporal RDF based on HBase. The proposed storage model applies the built-in time mechanism of HBase. Our experiments on LUBM dataset with temporal information added show that our storage model can store large temporal RDF data and obtain good query efficiency.

Download Full-text

Researching Why-Not Questions in Skyline Query Based on Orthogonal Range

Electronics ◽

10.3390/electronics9030500 ◽

2020 ◽

Vol 9 (3) ◽

pp. 500 ◽

Cited By ~ 1

Author(s):

Ping Sun ◽

Caimei Liang ◽

Guohui Li ◽

Ling Yuan

Keyword(s):

Experimental Results ◽

Skyline Query ◽

High Quality ◽

Query Refinement ◽

Skyline Queries ◽

The Real ◽

Query Efficiency ◽

Synthetic Datasets

This paper aims to answer “why-not” questions in skyline queries based on the orthogonal query range (i.e., ORSQ). These queries retrieve skyline points within a rectangular query range, which improves query efficiency. Answering why-not questions in ORSQ can help users analyze query results and make decisions. We discuss the causes of why-not questions in ORSQ. Then, we outline how to modify the why-not point and the orthogonal query range so that the why-not point is included in the result of the skyline query based on the orthogonal range. When the why-not point is in the orthogonal range, we show how to modify the why-not point and narrow the orthogonal range. We also present how to expand the orthogonal range when the why-not point is not in the orthogonal range. We effectively combine query refinement and data modification techniques to produce meaningful answers. The experimental results demonstrate that the proposed algorithms have high-quality explanations for why-not questions in ORSQ in the real and synthetic datasets.

Download Full-text

A Method to Improve the Fresh Data Query Efficiency of Blockchain

2020 12th International Conference on Measuring Technology and Mechatronics Automation (ICMTMA) ◽

10.1109/icmtma50254.2020.00179 ◽

2020 ◽

Author(s):

Xinhua Liu ◽

Xirui Yu ◽

Xiaolin Ma ◽

Hailan Kuang

Keyword(s):

Data Query ◽

Query Efficiency

Download Full-text

Improving Query Efficiency of Black-Box Adversarial Attack

Computer Vision – ECCV 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58595-2_7 ◽

2020 ◽

pp. 101-116

Author(s):

Yang Bai ◽

Yuyuan Zeng ◽

Yong Jiang ◽

Yisen Wang ◽

Shu-Tao Xia ◽

...

Keyword(s):

Black Box ◽

Adversarial Attack ◽

Query Efficiency

Download Full-text

Strark-H: A Strategy for Spatial Data Storage to Improve Query Efficiency Based on Spark

Algorithms and Architectures for Parallel Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-38991-8_19 ◽

2020 ◽

pp. 285-299

Author(s):

Weitao Zou ◽

Weipeng Jing ◽

Guangsheng Chen ◽

Yang Lu

Keyword(s):

Data Storage ◽

Spatial Data ◽

Query Efficiency

Download Full-text

Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.3301176 ◽

2019 ◽

Vol 33 ◽

pp. 176-183 ◽

Cited By ~ 11

Author(s):

Chao Li ◽

Cheng Deng ◽

Lei Wang ◽

De Xie ◽

Xianglong Liu

Keyword(s):

Large Scale ◽

State Of The Art ◽

The State ◽

Storage Cost ◽

Common Representation ◽

Benchmark Datasets ◽

Query Efficiency ◽

Hash Codes

In recent years, hashing has attracted more and more attention owing to its superior capacity of low storage cost and high query efficiency in large-scale cross-modal retrieval. Benefiting from deep leaning, continuously compelling results in cross-modal retrieval community have been achieved. However, existing deep cross-modal hashing methods either rely on amounts of labeled information or have no ability to learn an accuracy correlation between different modalities. In this paper, we proposed Unsupervised coupled Cycle generative adversarial Hashing networks (UCH), for cross-modal retrieval, where outer-cycle network is used to learn powerful common representation, and inner-cycle network is explained to generate reliable hash codes. Specifically, our proposed UCH seamlessly couples these two networks with generative adversarial mechanism, which can be optimized simultaneously to learn representation and hash codes. Extensive experiments on three popular benchmark datasets show that the proposed UCH outperforms the state-of-the-art unsupervised cross-modal hashing methods.

Download Full-text

query efficiency
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

A blockchain index structure based on subchain query

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

Time-aware collective spatial keyword query

Self-Attention and Adversary Guided Hashing Network for Cross-Modal Retrieval

Temporal RDF(S) Data Storage and Query with HBase

Researching Why-Not Questions in Skyline Query Based on Orthogonal Range

A Method to Improve the Fresh Data Query Efficiency of Blockchain

Improving Query Efficiency of Black-Box Adversarial Attack

Strark-H: A Strategy for Spatial Data Storage to Improve Query Efficiency Based on Spark

Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval

Export Citation Format

query efficiencyRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

A blockchain index structure based on subchain query

PoBery: Possibly-complete Big Data Queries with Probabilistic Data Placement and Scanning

Time-aware collective spatial keyword query

Self-Attention and Adversary Guided Hashing Network for Cross-Modal Retrieval

Temporal RDF(S) Data Storage and Query with HBase

Researching Why-Not Questions in Skyline Query Based on Orthogonal Range

A Method to Improve the Fresh Data Query Efficiency of Blockchain

Improving Query Efficiency of Black-Box Adversarial Attack

Strark-H: A Strategy for Spatial Data Storage to Improve Query Efficiency Based on Spark

Coupled CycleGAN: Unsupervised Hashing Network for Cross-Modal Retrieval

query efficiency
Recently Published Documents