Comparative study of NoSQL databases for big data storage

Gourav Bathla; Rinkle Rani; Himanshu Aggarwal

doi:10.14419/ijet.v7i2.6.10072

Comparative study of NoSQL databases for big data storage

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.6.10072 ◽

2018 ◽

Vol 7 (2.6) ◽

pp. 83

Author(s):

Gourav Bathla ◽

Rinkle Rani ◽

Himanshu Aggarwal

Keyword(s):

Big Data ◽

Data Storage ◽

Relational Databases ◽

Large Scale ◽

Technology Selection ◽

Business Organizations ◽

Nosql Databases ◽

Real Solutions ◽

Adequate Knowledge ◽

Long Time

Big data is a collection of large scale of structured, semi-structured and unstructured data. It is generated due to Social networks, Business organizations, interaction and views of social connected users. It is used for important decision making in business and research organizations. Storage which is efficient to process this large scale of data to extract important information in less response time is the need of current competitive time. Relational databases which have ruled the storage technology for such a long time seems not suitable for mixed types of data. Data can not be represented just in the form of rows and columns in tables. NoSQL (Not only SQL) is complementary to SQL technology which can provide various formats for storage that can be easily compatible with high velocity,large volume and different variety of data. NoSQL databases are categorized in four techniques- Column oriented, Key Value based, Graph based and Document oriented databases. There are approximately 120 real solutions existing for these categories; most commonly used solutions are elaborated in Introduction section. Several research works have been carried out to analyze these NoSQL technology solutions. These studies have not mentioned the situations in which a particular data storage technique is to be chosen. In this study and analysis, we have tried our best to provide answer on technology selection based on specific requirement to the reader. In previous research, comparisons amongNoSQL data storage techniques have been described by using real examples like MongoDB, Neo4J etc. Our observation is that if users have adequate knowledge of NoSQL categories and their comparison, then it is easy for them to choose best suitable category and then real solutions can be selected from this category.

Download Full-text

An adaptive spark-based framework for querying large-scale NoSQL and relational databases

PLoS ONE ◽

10.1371/journal.pone.0255562 ◽

2021 ◽

Vol 16 (8) ◽

pp. e0255562

Author(s):

Eman Khashan ◽

Ali Eldesouky ◽

Sally Elghamrawy

Keyword(s):

Big Data ◽

Data Storage ◽

Relational Databases ◽

Large Scale ◽

Query Languages ◽

Heterogeneous Data ◽

Query Execution ◽

Database Queries ◽

Nosql Databases ◽

Complex Queries

The growing popularity of big data analysis and cloud computing has created new big data management standards. Sometimes, programmers may interact with a number of heterogeneous data stores depending on the information they are responsible for: SQL and NoSQL data stores. Interacting with heterogeneous data models via numerous APIs and query languages imposes challenging tasks on multi-data processing developers. Indeed, complex queries concerning homogenous data structures cannot currently be performed in a declarative manner when found in single data storage applications and therefore require additional development efforts. Many models were presented in order to address complex queries Via multistore applications. Some of these models implemented a complex unified and fast model, while others’ efficiency is not good enough to solve this type of complex database queries. This paper provides an automated, fast and easy unified architecture to solve simple and complex SQL and NoSQL queries over heterogeneous data stores (CQNS). This proposed framework can be used in cloud environments or for any big data application to automatically help developers to manage basic and complicated database queries. CQNS consists of three layers: matching selector layer, processing layer, and query execution layer. The matching selector layer is the heart of this architecture in which five of the user queries are examined if they are matched with another five queries stored in a single engine stored in the architecture library. This is achieved through a proposed algorithm that directs the query to the right SQL or NoSQL database engine. Furthermore, CQNS deal with many NoSQL Databases like MongoDB, Cassandra, Riak, CouchDB, and NOE4J databases. This paper presents a spark framework that can handle both SQL and NoSQL Databases. Four scenarios’ benchmarks datasets are used to evaluate the proposed CQNS for querying different NoSQL Databases in terms of optimization process performance and query execution time. The results show that, the CQNS achieves best latency and throughput in less time among the compared systems.

Download Full-text

Modeling and Indexing Spatiotemporal Trajectory Data in Non-Relational Databases

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Managing Big Data in Cloud Computing Environments ◽

10.4018/978-1-4666-9834-5.ch006 ◽

2016 ◽

pp. 133-162 ◽

Cited By ~ 7

Author(s):

Berkay Aydin ◽

Vijay Akkineni ◽

Rafal A Angryk

Keyword(s):

Data Storage ◽

Relational Databases ◽

Large Scale ◽

Database Systems ◽

Distributed Database ◽

Data Models ◽

Trajectory Data ◽

Nosql Databases ◽

Distributed Database Systems ◽

Partitioned Data

With the ever-growing nature of spatiotemporal data, it is inevitable to use non-relational and distributed database systems for storing massive spatiotemporal datasets. In this chapter, the important aspects of non-relational (NoSQL) databases for storing large-scale spatiotemporal trajectory data are investigated. Mainly, two data storage schemata are proposed for storing trajectories, which are called traditional and partitioned data models. Additionally spatiotemporal and non-spatiotemporal indexing structures are designed for efficiently retrieving data under different usage scenarios. The results of the experiments exhibit the advantages of utilizing data models and indexing structures for various query types.

Download Full-text

Hybrid Mobile-App. on Multi-MEC Platforms in NFV Environment

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.38.24587 ◽

2018 ◽

Vol 7 (4.38) ◽

pp. 383

Author(s):

Jinsul Kim ◽

Akm Ashiquzzaman ◽

Van Quan Nguyen ◽

Sang Woo Kim

Keyword(s):

Big Data ◽

Data Storage ◽

Large Scale ◽

Web Applications ◽

Rapid Development ◽

Query Languages ◽

Vital Role ◽

Mobile App ◽

Network Function Virtualization ◽

Nosql Databases

In recent times, practicality of web applications has become more reliant upon big-data orientated materials such 4K videos, hi-def. resolution images, lossless audios and massive texts. Structured Query Languages (SQL) faces compatibility issues with large scale databases. Because of this data storage problem, NoSQL databases are used for storing big-data. NoSQL databases have been recently gaining traction with many options such MongoDB, CouchDB, Redis and Apache Cassandra. One of the major restrictions companies, enterprises and developers encounter during developing an application is multiplicative cost of building a native programing across different platforms. Besides, network Function Virtualization (NFV) plays a vital role for providing services for utilizing such applications in larger and more effective scale. Hence, in this paper, we discussed our main motivation behind selecting Iconic Framework, a hybrid system for rapid development real-time application based on Firebase in the NFV environment cooperating with Mobile Edge Computing (MEC). As a result, this approach provides comparatively flexible features.

Download Full-text

Comparison between SQL and NoSQL Databases and Their Relationship with Big Data Analytics

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2019/v4i230108 ◽

2019 ◽

pp. 1-10 ◽

Cited By ~ 5

Author(s):

Wajid Ali ◽

Muhammad Usman Shafique ◽

Muhammad Arslan Majeed ◽

Ali Raza

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Relational Databases ◽

Web Applications ◽

Distributed Storage ◽

Big Data Analytics ◽

Nosql Databases ◽

Advantages And Disadvantages ◽

The World

A key ingredient in the world of cloud computing is a database that can be used by a great number of users. Distributed storage mechanisms become the de-facto method for data storage used by companies for the new generation of web applications. In the world of data storage, NoSQL (usually interpreted as "not only SQL" by developers) database is a growing trend. It is said that NoSQL alternates with the most widely used relational databases for the data storage, but, as the name implies, it does not fully replace the SQL. In this paper we will discuss about SQL and NoSQL databases, comparison of traditional SQL with NoSQL databases for Big Data analytics, NoSQL data models, types of NoSQL data stores, characteristics and features of each data store, advantages and disadvantages of NoSQL and RDBMS.

Download Full-text

Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented Databases

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2018070103 ◽

2018 ◽

Vol 14 (3) ◽

pp. 44-68 ◽

Cited By ~ 1

Author(s):

Fatma Abdelhedi ◽

Amal Ait Brahim ◽

Gilles Zurfluh

Keyword(s):

Big Data ◽

Data Warehouse ◽

Relational Databases ◽

Traditional Approach ◽

Physical Models ◽

Decision Making Process ◽

Nosql Databases ◽

Care Field ◽

Sufficient Degree

Nowadays, most organizations need to improve their decision-making process using Big Data. To achieve this, they have to store Big Data, perform an analysis, and transform the results into useful and valuable information. To perform this, it's necessary to deal with new challenges in designing and creating data warehouse. Traditionally, creating a data warehouse followed well-governed process based on relational databases. The influence of Big Data challenged this traditional approach primarily due to the changing nature of data. As a result, using NoSQL databases has become a necessity to handle Big Data challenges. In this article, the authors show how to create a data warehouse on NoSQL systems. They propose the Object2NoSQL process that generates column-oriented physical models starting from a UML conceptual model. To ensure efficient automatic transformation, they propose a logical model that exhibits a sufficient degree of independence so as to enable its mapping to one or more column-oriented platforms. The authors provide experiments of their approach using a case study in the health care field.

Download Full-text

NoSQL Databases

Advances in Data Mining and Database Management - Handbook of Research on Cloud Infrastructures for Big Data Analytics ◽

10.4018/978-1-4666-5864-6.ch008 ◽

2014 ◽

pp. 186-215 ◽

Cited By ~ 2

Author(s):

Ganesh Chandra Deka

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Open Source ◽

Data Storage ◽

Big Data Processing ◽

Nosql Databases ◽

Data Intensive ◽

Huge Data ◽

Data Intensive Applications

NoSQL databases are designed to meet the huge data storage requirements of cloud computing and big data processing. NoSQL databases have lots of advanced features in addition to the conventional RDBMS features. Hence, the “NoSQL” databases are popularly known as “Not only SQL” databases. A variety of NoSQL databases having different features to deal with exponentially growing data-intensive applications are available with open source and proprietary option. This chapter discusses some of the popular NoSQL databases and their features on the light of CAP theorem.

Download Full-text

Towards Massive RDF Storage in NoSQL Databases

Advances in Data Mining and Database Management - Emerging Technologies and Applications in Data Processing and Management ◽

10.4018/978-1-5225-8446-9.ch013 ◽

2019 ◽

pp. 263-284 ◽

Cited By ~ 2

Author(s):

Zongmin Ma ◽

Li Yan

Keyword(s):

Data Storage ◽

Large Scale ◽

Future Research ◽

Nosql Databases ◽

Current State ◽

Data Store ◽

Rdf Data ◽

Description Framework ◽

Resource Description ◽

The Web

The resource description framework (RDF) is a model for representing information resources on the web. With the widespread acceptance of RDF as the de-facto standard recommended by W3C (World Wide Web Consortium) for the representation and exchange of information on the web, a huge amount of RDF data is being proliferated and becoming available. So, RDF data management is of increasing importance and has attracted attention in the database community as well as the Semantic Web community. Currently, much work has been devoted to propose different solutions to store large-scale RDF data efficiently. In order to manage massive RDF data, NoSQL (not only SQL) databases have been used for scalable RDF data store. This chapter focuses on using various NoSQL databases to store massive RDF data. An up-to-date overview of the current state of the art in RDF data storage in NoSQL databases is provided. The chapter aims at suggestions for future research.

Download Full-text

A Review of RDF Storage in NoSQL Databases

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Managing Big Data in Cloud Computing Environments ◽

10.4018/978-1-4666-9834-5.ch009 ◽

2016 ◽

pp. 210-229 ◽

Cited By ~ 2

Author(s):

Zongmin Ma ◽

Li Yan

Keyword(s):

Data Storage ◽

Large Scale ◽

Future Research ◽

Nosql Databases ◽

Current State ◽

Data Store ◽

Rdf Data ◽

Description Framework ◽

Resource Description ◽

The Web

The Resource Description Framework (RDF) is a model for representing information resources on the Web. With the widespread acceptance of RDF as the de-facto standard recommended by W3C (World Wide Web Consortium) for the representation and exchange of information on the Web, a huge amount of RDF data is being proliferated and becoming available. So RDF data management is of increasing importance, and has attracted attentions in the database community as well as the Semantic Web community. Currently much work has been devoted to propose different solutions to store large-scale RDF data efficiently. In order to manage massive RDF data, NoSQL (“not only SQL”) databases have been used for scalable RDF data store. This chapter focuses on using various NoSQL databases to store massive RDF data. An up-to-date overview of the current state of the art in RDF data storage in NoSQL databases is provided. The chapter aims at suggestions for future research.

Download Full-text

Performance Analysis of NoSQL and Relational Databases with CouchDB and MySQL for Application’s Data Storage

Applied Sciences ◽

10.3390/app10238524 ◽

2020 ◽

Vol 10 (23) ◽

pp. 8524

Author(s):

Cornelia A. Győrödi ◽

Diana V. Dumşe-Burescu ◽

Doina R. Zmaranda ◽

Robert Ş. Győrödi ◽

Gianina A. Gabor ◽

...

Keyword(s):

Big Data ◽

Data Storage ◽

Relational Databases ◽

Database Systems ◽

Application Performance ◽

Big Data Applications ◽

Database Technology ◽

Important Challenge ◽

Big Data Application ◽

The Impact

In the current context of emerging several types of database systems (relational and non-relational), choosing the type and database system for storing large amounts of data in today’s big data applications has become an important challenge. In this paper, we aimed to provide a comparative evaluation of two popular open-source database management systems (DBMSs): MySQL as a relational DBMS and, more recently, as a non-relational DBMS, and CouchDB as a non-relational DBMS. This comparison was based on performance evaluation of CRUD (CREATE, READ, UPDATE, DELETE) operations for different amounts of data to show how these two databases could be modeled and used in an application and highlight the differences in the response time and complexity. The main objective of the paper was to make a comparative analysis of the impact that each specific DBMS has on application performance when carrying out CRUD requests. To perform the analysis and to ensure the consistency of tests, two similar applications were developed in Java, one using MySQL and the other one using CouchDB database; these applications were further used to evaluate the time responses for each database technology on the same CRUD operations on the database. Finally, a comprehensive discussion based on the results of the analysis was performed that centered on the results obtained and several conclusions were revealed. Advantages and drawbacks for each DBMS are outlined to support a decision for choosing a specific type of DBMS that could be used in a big data application.

Download Full-text

When Relational-Based Applications Go to NoSQL Databases: A Survey

Information ◽

10.3390/info10070241 ◽

2019 ◽

Vol 10 (7) ◽

pp. 241

Author(s):

Geomar A. Schreiner ◽

Denio Duarte ◽

Ronaldo dos S. Melo

Keyword(s):

Big Data ◽

Comparative Analysis ◽

Relational Databases ◽

Research Area ◽

Relational Data ◽

Data Sets ◽

System Architectures ◽

Nosql Databases ◽

State Of Art ◽

Open Issues

Several data-centric applications today produce and manipulate a large volume of data, the so-called Big Data. Traditional databases, in particular, relational databases, are not suitable for Big Data management. As a consequence, some approaches that allow the definition and manipulation of large relational data sets stored in NoSQL databases through an SQL interface have been proposed, focusing on scalability and availability. This paper presents a comparative analysis of these approaches based on an architectural classification that organizes them according to their system architectures. Our motivation is that wrapping is a relevant strategy for relational-based applications that intend to move relational data to NoSQL databases (usually maintained in the cloud). We also claim that this research area has some open issues, given that most approaches deal with only a subset of SQL operations or give support to specific target NoSQL databases. Our intention with this survey is, therefore, to contribute to the state-of-art in this research area and also provide a basis for choosing or even designing a relational-to-NoSQL data wrapping solution.

Download Full-text