Distribution of Database in Cloud Based on Associated Matrix

In cloud environments, hardware configurations, data usage, and workload allocations are continuously changing. These changes make it difficult for the query optimizer of a cloud database management system (DBMS) to select an optimal query execution plan (QEP). In order to optimize a query with a more accurate cost estimation, performing query re-optimizations during the query execution has been proposed in the literature. However, some of there-optimizations may not provide any performance gain in terms of query response time or monetary costs, which are the two optimization objectives for cloud databases, and may also have negative impacts on the performance due to their overheads. This raises the question of how to determine when are-optimization is beneficial. In this paper, we present a technique called ReOptML that uses machine learning to enable effective re-optimizations. This technique executes a query in stages, employs a machine learning model to predict whether a query re-optimization is beneficial after a stage is executed, and invokes the query optimizer to perform the re-optimization automatically. The experiments comparing ReOptML with existing query re-optimization algorithms show that ReOptML improves query response time from 13% to 35% for skew data and from 13% to 21% for uniform data, and improves monetary cost paid to cloud service providers from 17% to 35% on skewdata.

Download Full-text

IIoT Based Hierarchical Data Distribution Strategy over Edge and End Device

2020 IEEE 6th International Conference on Computer and Communications (ICCC) ◽

10.1109/iccc51575.2020.9345250 ◽

2020 ◽

Author(s):

Haoyu Yu ◽

Dong Yu ◽

Yi Hu ◽

Chuting Wang

Keyword(s):

Data Distribution ◽

Hierarchical Data ◽

Distribution Strategy

Download Full-text

Improving Computation Power by Reducing Query Response Time in Peer-to-Peer Environment

Journal of Computer Science ◽

10.3844/jcssp.2011.434.439 ◽

2011 ◽

Vol 7 (3) ◽

pp. 434-439

Author(s):

Nandagopal

Keyword(s):

Response Time ◽

Peer To Peer ◽

Query Response Time ◽

Peer Environment

Download Full-text

Improving Query Response Time for Graph Data Using Materialization

Journal of Independent Studies and Research - Computing ◽

10.31645/jisrc/(2015).13.2.0004 ◽

2015 ◽

Vol 13 (2) ◽

Author(s):

Abdul Waheed ◽

◽

Dr. Syed Saif ur Rahman

Keyword(s):

Response Time ◽

Graph Data ◽

Query Response Time

Download Full-text

Data Mining of the Association Rules Based on the Cloud Database

Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013) ◽

10.2991/iccsee.2013.387 ◽

2013 ◽

Author(s):

Tianxiang Zhu ◽

Shuhui Sun ◽

Dan Zhang ◽

Xin Liu

Keyword(s):

Data Mining ◽

Association Rules ◽

Cloud Database

Download Full-text

Framework for GeoSpatial Query Processing by Integrating Cassandra With Hadoop

Geospatial Intelligence ◽

10.4018/978-1-5225-8054-6.ch017 ◽

2019 ◽

pp. 353-388

Author(s):

S. Vasavi ◽

Mallela Padma Priya ◽

Anu A. Gokhale

Keyword(s):

Response Time ◽

Query Processing ◽

Geospatial Data ◽

Distributed Data ◽

Nosql Databases ◽

Storage And Retrieval ◽

Query Response Time ◽

Partitioning Algorithm ◽

Hadoop Framework ◽

Partitioning Technique

We are moving towards digitization and making all our devices, such as sensors and cameras, connected to internet, producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, for achieving scalability and availability. Hadoop framework has been developed for storing and processing distributed data. In this chapter, the authors investigated the storage and retrieval of geospatial data by integrating Hadoop and Cassandra using prefix-based partitioning and Cassandra's default partitioning algorithm (i.e., Murmur3partitioner) techniques. Geohash value is generated, which acts as a partition key and also helps in effective search. Hence, the time taken for retrieving data is optimized. When users request spatial queries like finding nearest locations, searching in Cassandra database starts using both partitioning techniques. A comparison on query response time is made so as to verify which method is more effective. Results show the prefix-based partitioning technique is more efficient than Murmur3 partitioning technique.

Download Full-text

Framework for GeoSpatial Query Processing by Integrating Cassandra With Hadoop

GIS Applications in the Tourism and Hospitality Industry - Advances in Hospitality, Tourism, and the Services Industry ◽

10.4018/978-1-5225-5088-4.ch001 ◽

2018 ◽

pp. 1-41

Author(s):

S. Vasavi ◽

Mallela Padma Priya ◽

Anu A. Gokhale

Keyword(s):

Response Time ◽

Query Processing ◽

Geospatial Data ◽

Distributed Data ◽

Nosql Databases ◽

Storage And Retrieval ◽

Query Response Time ◽

Partitioning Algorithm ◽

Hadoop Framework ◽

Partitioning Technique

We are moving towards digitization and making all our devices, such as sensors and cameras, connected to internet, producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, for achieving scalability and availability. Hadoop framework has been developed for storing and processing distributed data. In this chapter, the authors investigated the storage and retrieval of geospatial data by integrating Hadoop and Cassandra using prefix-based partitioning and Cassandra's default partitioning algorithm (i.e., Murmur3partitioner) techniques. Geohash value is generated, which acts as a partition key and also helps in effective search. Hence, the time taken for retrieving data is optimized. When users request spatial queries like finding nearest locations, searching in Cassandra database starts using both partitioning techniques. A comparison on query response time is made so as to verify which method is more effective. Results show the prefix-based partitioning technique is more efficient than Murmur3 partitioning technique.

Download Full-text