Improving Query Response Time for Graph Data Using Materialization

We are moving towards digitization and making all our devices, such as sensors and cameras, connected to internet, producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, for achieving scalability and availability. Hadoop framework has been developed for storing and processing distributed data. In this chapter, the authors investigated the storage and retrieval of geospatial data by integrating Hadoop and Cassandra using prefix-based partitioning and Cassandra's default partitioning algorithm (i.e., Murmur3partitioner) techniques. Geohash value is generated, which acts as a partition key and also helps in effective search. Hence, the time taken for retrieving data is optimized. When users request spatial queries like finding nearest locations, searching in Cassandra database starts using both partitioning techniques. A comparison on query response time is made so as to verify which method is more effective. Results show the prefix-based partitioning technique is more efficient than Murmur3 partitioning technique.

Get full-text (via PubEx)

Framework for GeoSpatial Query Processing by Integrating Cassandra With Hadoop

GIS Applications in the Tourism and Hospitality Industry - Advances in Hospitality, Tourism, and the Services Industry ◽

10.4018/978-1-5225-5088-4.ch001 ◽

2018 ◽

pp. 1-41

Author(s):

S. Vasavi ◽

Mallela Padma Priya ◽

Anu A. Gokhale

Keyword(s):

Response Time ◽

Query Processing ◽

Geospatial Data ◽

Distributed Data ◽

Nosql Databases ◽

Storage And Retrieval ◽

Query Response Time ◽

Partitioning Algorithm ◽

Hadoop Framework ◽

Partitioning Technique

We are moving towards digitization and making all our devices, such as sensors and cameras, connected to internet, producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, for achieving scalability and availability. Hadoop framework has been developed for storing and processing distributed data. In this chapter, the authors investigated the storage and retrieval of geospatial data by integrating Hadoop and Cassandra using prefix-based partitioning and Cassandra's default partitioning algorithm (i.e., Murmur3partitioner) techniques. Geohash value is generated, which acts as a partition key and also helps in effective search. Hence, the time taken for retrieving data is optimized. When users request spatial queries like finding nearest locations, searching in Cassandra database starts using both partitioning techniques. A comparison on query response time is made so as to verify which method is more effective. Results show the prefix-based partitioning technique is more efficient than Murmur3 partitioning technique.

Get full-text (via PubEx)

A Framework for Predicting Query Response Time

2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems ◽

10.1109/hpcc.2012.167 ◽

2012 ◽

Author(s):

Rekha Singhal

Keyword(s):

Response Time ◽

Query Response Time

Get full-text (via PubEx)

Analytical query response time evaluation for a two-level clustering hierarchy based wireless sensor network routing protocol

IEEE Communications Letters ◽

10.1109/lcomm.2010.05.091473 ◽

2010 ◽

Vol 14 (5) ◽

pp. 486-488 ◽

Cited By ~ 5

Author(s):

Siva Muruganathan ◽

Abu Sesay ◽

Witold Krzymien

Keyword(s):

Wireless Sensor Network ◽

Response Time ◽

Sensor Network ◽

Routing Protocol ◽

Network Routing ◽

Wireless Sensor ◽

Query Response Time ◽

Time Evaluation

Get full-text (via PubEx)

Reducing multidatabase query response time by tree balancing

ACM SIGMOD Record ◽

10.1145/568271.223846 ◽

1995 ◽

Vol 24 (2) ◽

pp. 293-303 ◽

Cited By ~ 5

Author(s):

Weimin Du ◽

Ming-Chien Shan ◽

Umeshwar Dayal

Keyword(s):

Response Time ◽

Query Response Time

Get full-text (via PubEx)

A parallel execution method for minimizing distributed query response time

IEEE Transactions on Parallel and Distributed Systems ◽

10.1109/71.139206 ◽

1992 ◽

Vol 3 (3) ◽

pp. 325-333 ◽

Cited By ~ 13

Author(s):

C. Wang ◽

A.L.P. Chen ◽

S.-C. Shyu

Keyword(s):

Response Time ◽

Parallel Execution ◽

Query Response Time ◽

Distributed Query

Get full-text (via PubEx)

SELECTIVELY MATERIALIZING DATA IN MEDIATORS BY ANALYZING USER QUERIES

International Journal of Cooperative Information Systems ◽

10.1142/s0218843002000534 ◽

2002 ◽

Vol 11 (01n02) ◽

pp. 119-144 ◽

Cited By ~ 6

Author(s):

NAVEEN ASHISH ◽

CRAIG KNOBLOCK ◽

CYRUS SHAHABI

Keyword(s):

Response Time ◽

Data Sources ◽

Multiple Data Sources ◽

Optimization System ◽

Query Response Time ◽

Multiple Data ◽

Building Information ◽

Information Mediators ◽

User Queries

There is currently great interest in building information mediators that can integrate information from multiple data sources such as databases or Web sources. The query response time for such mediators is typically quite high, mainly due to the time spent in retrieving data from remote sources. We present an approach for optimizing the performance of information mediators by selectively materializing data. We first present our overall framework for materialization in a mediator environment. The data is materialized selectively. We outline the factors that are considered in selecting data to materialize. We present an algorithm for identifying classes of data to materialize by analyzing one of the factors which is the distribution of user queries. We present results with an implemented version of our optimization system for the Ariadne information mediator, which show the effectiveness of our algorithm in extracting patterns of frequently accessed classes from user queries. We also demonstrate the effectiveness of approach in optimizing mediator performance by materializing such classes.

Get full-text (via PubEx)

A P2P object-oriented database system that supports multi-attribute and range queries with improved query response time

2010 International Symposium on Information Technology ◽

10.1109/itsim.2010.5561465 ◽

2010 ◽

Cited By ~ 1

Author(s):

Goh Chiao Wei ◽

Lim Tong Ming

Keyword(s):

Response Time ◽

Object Oriented ◽

Database System ◽

Range Queries ◽

Query Response Time ◽

Object Oriented Database

Get full-text (via PubEx)

An Empirical Analysis to Identify the Effect of Indexing on Influence Detection using Graph Databases

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.i1066.0789s19 ◽

2019 ◽

Vol 8 (9S) ◽

pp. 414-421 ◽

Cited By ~ 1

Keyword(s):

Social Media ◽

Response Time ◽

Empirical Analysis ◽

Time Complexity ◽

Data Retrieval ◽

Graph Databases ◽

Query Response Time ◽

Twitter Data ◽

Indexing Techniques ◽

Social Media Platforms

The data generated on social media platforms such as Twitter, Facebook, LinkedIn etc. are highly connected. Such data can be efficiently stored and analyzed using graph databases due to the inherent property of graphs to model connected data. To reduce the time complexity of data retrieval from huge graph databases, various indexing techniques are used. This paper presents an extensive empirical analysis on popular graph databases i.e. Neo4j, ArangoDB and OrientDB; with an aim to measure the competencies and effectiveness of primitive indexing techniques on query response time to identify the influencing entities from Twitter data. The analysis demonstrates that Neo4j performs efficient and stable for load, relation and property queries compare to other two databases whereas the performance of OrientDB can be improved using primitive indexing.

Get full-text (via PubEx)