Framework for GeoSpatial Query Processing by Integrating Cassandra With Hadoop

We are moving towards digitization and making all our devices, such as sensors and cameras, connected to internet, producing bigdata. This bigdata has variety of data and has paved the way to the emergence of NoSQL databases, like Cassandra, for achieving scalability and availability. Hadoop framework has been developed for storing and processing distributed data. In this chapter, the authors investigated the storage and retrieval of geospatial data by integrating Hadoop and Cassandra using prefix-based partitioning and Cassandra's default partitioning algorithm (i.e., Murmur3partitioner) techniques. Geohash value is generated, which acts as a partition key and also helps in effective search. Hence, the time taken for retrieving data is optimized. When users request spatial queries like finding nearest locations, searching in Cassandra database starts using both partitioning techniques. A comparison on query response time is made so as to verify which method is more effective. Results show the prefix-based partitioning technique is more efficient than Murmur3 partitioning technique.

Download Full-text

Framework for Visualization of GeoSpatial Query Processing by Integrating Redis With Spark

International Journal of Natural Computing Research ◽

10.4018/ijncr.2019070101 ◽

2019 ◽

Vol 8 (3) ◽

pp. 1-25 ◽

Cited By ~ 1

Author(s):

S. Vasavi ◽

V.N. Priyanka G ◽

Anu A. Gokhale

Keyword(s):

Big Data ◽

Geospatial Data ◽

Distributed Data ◽

Nosql Databases ◽

Real Time Processing ◽

Time Processing ◽

Storage And Retrieval ◽

Tourism Marketing ◽

Memory Store ◽

Benchmark Datasets

Nowadays we are moving towards digitization and making all our devices produce a variety of data, this has paved the way to the emergence of NoSQL databases like Cassandra, MongoDB, and Redis. Big data such as geospatial data allows for geospatial analytics in applications such as tourism, marketing, and rural development. Spark frameworks provide operators storage and processing of distributed data. This article proposes “GeoRediSpark” to integrate Redis with Spark. Redis is a key-value store that uses an in-memory store, hence integrating Redis with Spark can extend the real-time processing of geospatial data. The article investigates storage and retrieval of the Redis built-in geospatial queries and has added two new geospatial operators, GeoWithin and GeoIntersect, to enhance the capabilities of Redis. Hashed indexing is used to improve the processing performance. A comparison on Redis metrics with three benchmark datasets is made. Hashset is used to display geographic data. The output of geospatial queries is visualized to the type of place and the nature of the query using Tableau.

Download Full-text

Secured Geospatial Data Storage and Retrieval Using Spatial Hadoop Framework in Cloud Environment

2017 Second International Conference on Recent Trends and Challenges in Computational Models (ICRTCCM) ◽

10.1109/icrtccm.2017.77 ◽

2017 ◽

Cited By ~ 2

Author(s):

S. Karthi ◽

S. Prabu

Keyword(s):

Data Storage ◽

Geospatial Data ◽

Cloud Environment ◽

Storage And Retrieval ◽

Hadoop Framework

Download Full-text

Machine Learning for Query Processing System and Query Response Time using Hadoop

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtstciet15 ◽

2020 ◽

Vol 6 (8S) ◽

pp. 76-81

Author(s):

M.Srikanth and R.N.V.Jagan Mohan

Keyword(s):

Machine Learning ◽

Response Time ◽

Query Processing ◽

Processing System ◽

Query Response Time

Download Full-text

An Efficient, Secure, and Queryable Encryption for NoSQL-Based Databases Hosted on Untrusted Cloud Environments

International Journal of Information Security and Privacy ◽

10.4018/ijisp.2019040102 ◽

2019 ◽

Vol 13 (2) ◽

pp. 14-31

Author(s):

Mamdouh Alenezi ◽

Muhammad Usama ◽

Khaled Almustafa ◽

Waheed Iqbal ◽

Muhammad Ali Raza ◽

...

Keyword(s):

Query Processing ◽

State Of The Art ◽

Data Communication ◽

Nosql Databases ◽

High Concern ◽

Cloud Environments ◽

High Scalability ◽

And Performance ◽

Secure Query Processing ◽

Security Concern

NoSQL-based databases are attractive to store and manage big data mainly due to high scalability and data modeling flexibility. However, security in NoSQL-based databases is weak which raises concerns for users. Specifically, security of data at rest is a high concern for the users deployed their NoSQL-based solutions on the cloud because unauthorized access to the servers will expose the data easily. There have been some efforts to enable encryption for data at rest for NoSQL databases. However, existing solutions do not support secure query processing, and data communication over the Internet and performance of the proposed solutions are also not good. In this article, the authors address NoSQL data at rest security concern by introducing a system which is capable to dynamically encrypt/decrypt data, support secure query processing, and seamlessly integrate with any NoSQL- based database. The proposed solution is based on a combination of chaotic encryption and Order Preserving Encryption (OPE). The experimental evaluation showed excellent results when integrated the solution with MongoDB and compared with the state-of-the-art existing work.

Download Full-text

Sliding window top-k dominating query processing over distributed data streams

Distributed and Parallel Databases ◽

10.1007/s10619-015-7187-9 ◽

2015 ◽

Vol 34 (4) ◽

pp. 535-566 ◽

Cited By ~ 6

Author(s):

Daichi Amagata ◽

Takahiro Hara ◽

Shojiro Nishio

Keyword(s):

Query Processing ◽

Data Streams ◽

Sliding Window ◽

Distributed Data ◽

Distributed Data Streams

Download Full-text

Improving Computation Power by Reducing Query Response Time in Peer-to-Peer Environment

Journal of Computer Science ◽

10.3844/jcssp.2011.434.439 ◽

2011 ◽

Vol 7 (3) ◽

pp. 434-439

Author(s):

Nandagopal

Keyword(s):

Response Time ◽

Peer To Peer ◽

Query Response Time ◽

Peer Environment

Download Full-text

Improving Query Response Time for Graph Data Using Materialization

Journal of Independent Studies and Research - Computing ◽

10.31645/jisrc/(2015).13.2.0004 ◽

2015 ◽

Vol 13 (2) ◽

Author(s):

Abdul Waheed ◽

◽

Dr. Syed Saif ur Rahman

Keyword(s):

Response Time ◽

Graph Data ◽

Query Response Time

Download Full-text

WPS ENABLED SDI: AN OPEN SOURCE APPROACH TO PROVIDE GEOPROCESSING IN WEB ENVIRONMENT

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-5-w2-119-2019 ◽

2019 ◽

Vol IV-5/W2 ◽

pp. 119-126

Author(s):

A. K. Tripathi ◽

S. Agrawal ◽

R. D. Gupta

Keyword(s):

Open Source ◽

Spatial Data ◽

Geospatial Data ◽

Distributed Data ◽

Data Infrastructure ◽

Web Environment ◽

Local Resources ◽

Processing Service ◽

Server Architecture ◽

The Web

Abstract. Sharing and management of geospatial data among different communities and users is a challenge which is suitably addressed by Spatial Data Infrastructure (SDI). SDI helps people in the discovery, editing, processing and visualization of spatial data. The user can download the data from SDI and process it using the local resources. However, large volume and heterogeneity of data make this processing difficult at the client end. This problem can be resolved by orchestrating the Web Processing Service (WPS) with SDI. WPS is a service interface through which geoprocessing can be done over the internet. In this paper, a WPS enabled SDI framework with OGC compliant services is conceptualized and developed. It is based on the three tier client server architecture. OGC services are provided through GeoServer. WPS extension of GeoServer is used to perform geospatial data processing and analysis. The developed framework is utilized to create a public health SDI prototype using Open Source Software (OSS). The integration of WPS with SDI demonstrates how the various data analysis operations of WPS can be performed over the web on distributed data sources provided by SDI.

Download Full-text

DocBase

Innovations in Database Design, Web Applications, and Information Systems Management ◽

10.4018/978-1-4666-2044-5.ch014 ◽

2013 ◽

pp. 365-393

Author(s):

Arijit Sengupta ◽

Ramesh Venkataraman

Keyword(s):

Query Processing ◽

Query Language ◽

Formal Model ◽

Query Languages ◽

Prototype System ◽

Storage And Retrieval ◽

Visual Query Formulation ◽

Visual Query ◽

Nested Relations ◽

Entity Relationship

This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.

Download Full-text