Data Models in NoSQL Databases for Big Data Contexts

SQL and NoSQL Databases in the Context of Industry 4.0

10.20944/preprints202111.0019.v1 ◽

2021 ◽

Author(s):

Vitor Furlan de Oliveira ◽

Marcosiris Amorim de Oliveira Pessoa ◽

Fabrício Junqueira ◽

Paulo Eigi Miyagi

Keyword(s):

Big Data ◽

Data Storage ◽

Industry 4.0 ◽

Transformation Process ◽

Data Models ◽

Technological Transformation ◽

Nosql Databases ◽

Five Dimensions ◽

Advantages And Disadvantages ◽

Logical Data

The data-oriented paradigm has proven to be fundamental for the technological transformation process that characterizes Industry 4.0 (I4.0) so that Big Data & Analytics is considered a technological pillar of this process. The literature reports a series of system architecture proposals that seek to implement the so-called Smart Factory, which is primarily data-driven. Many of these proposals treat data storage solutions as mere entities that support the architecture's functionalities. However, choosing which logical data model to use can significantly affect the performance of the architecture. This work identifies the advantages and disadvantages of relational (SQL) and non-relational (NoSQL) data models for I4.0, taking into account the nature of the data in this process. The characterization of data in the context of I4.0 is based on the five dimensions of Big Data and a standardized format for representing information of assets in the virtual world, the Asset Administration Shell. This work allows identifying appropriate transactional properties and logical data models according to the volume, variety, velocity, veracity, and value of the data. In this way, it is possible to describe the suitability of SQL and NoSQL databases for different scenarios within I4.0.

Download Full-text

SQL and NoSQL Databases in the Context of Industry 4.0

Machines ◽

10.3390/machines10010020 ◽

2021 ◽

Vol 10 (1) ◽

pp. 20

Author(s):

Vitor Furlan de Oliveira ◽

Marcosiris Amorim de Oliveira Pessoa ◽

Fabrício Junqueira ◽

Paulo Eigi Miyagi

Keyword(s):

Big Data ◽

Data Storage ◽

Industry 4.0 ◽

Manufacturing Systems ◽

Intelligent Manufacturing ◽

Transformation Process ◽

Data Models ◽

Nosql Databases ◽

Advantages And Disadvantages ◽

Logical Data

The data-oriented paradigm has proven to be fundamental for the technological transformation process that characterizes Industry 4.0 (I4.0) so that big data and analytics is considered a technological pillar of this process. The goal of I4.0 is the implementation of the so-called Smart Factory, characterized by Intelligent Manufacturing Systems (IMS) that overcome traditional manufacturing systems in terms of efficiency, flexibility, level of integration, digitalization, and intelligence. The literature reports a series of system architecture proposals for IMS, which are primarily data driven. Many of these proposals treat data storage solutions as mere entities that support the architecture’s functionalities. However, choosing which logical data model to use can significantly affect the performance of the IMS. This work identifies the advantages and disadvantages of relational (SQL) and non-relational (NoSQL) data models for I4.0, considering the nature of the data in this process. The characterization of data in the context of I4.0 is based on the five dimensions of big data and a standardized format for representing information of assets in the virtual world, the Asset Administration Shell. This work allows identifying appropriate transactional properties and logical data models according to the volume, variety, velocity, veracity, and value of the data. In this way, it is possible to describe the suitability of relational and NoSQL databases for different scenarios within I4.0.

Download Full-text

Evaluation of Banking Standards to ascertain their suitability for building Data Models for Big data based Data Lake for Banking Domain

10.1109/temsmet51618.2020.9557578 ◽

2020 ◽

Author(s):

Nirmal B. Satyendra ◽

Neeraja K. Swami ◽

Priyanka V. Bhailume

Keyword(s):

Big Data ◽

Data Models

Download Full-text

Benchmarking Big Data OLAP NoSQL Databases

Ubiquitous Networking - Lecture Notes in Computer Science ◽

10.1007/978-3-030-02849-7_8 ◽

2018 ◽

pp. 82-94 ◽

Cited By ~ 2

Author(s):

Mohammed El Malki ◽

Arlind Kopliku ◽

Essaid Sabir ◽

Olivier Teste

Keyword(s):

Big Data ◽

Nosql Databases

Download Full-text

Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented Databases

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2018070103 ◽

2018 ◽

Vol 14 (3) ◽

pp. 44-68 ◽

Cited By ~ 1

Author(s):

Fatma Abdelhedi ◽

Amal Ait Brahim ◽

Gilles Zurfluh

Keyword(s):

Big Data ◽

Data Warehouse ◽

Relational Databases ◽

Traditional Approach ◽

Physical Models ◽

Decision Making Process ◽

Nosql Databases ◽

Care Field ◽

Sufficient Degree

Nowadays, most organizations need to improve their decision-making process using Big Data. To achieve this, they have to store Big Data, perform an analysis, and transform the results into useful and valuable information. To perform this, it's necessary to deal with new challenges in designing and creating data warehouse. Traditionally, creating a data warehouse followed well-governed process based on relational databases. The influence of Big Data challenged this traditional approach primarily due to the changing nature of data. As a result, using NoSQL databases has become a necessity to handle Big Data challenges. In this article, the authors show how to create a data warehouse on NoSQL systems. They propose the Object2NoSQL process that generates column-oriented physical models starting from a UML conceptual model. To ensure efficient automatic transformation, they propose a logical model that exhibits a sufficient degree of independence so as to enable its mapping to one or more column-oriented platforms. The authors provide experiments of their approach using a case study in the health care field.

Download Full-text

A Domain Specific Entity Linking Approach Consuming Multistore Environment

Journal of Intelligent Systems with Applications ◽

10.54856/jiswa.201805016 ◽

2018 ◽

pp. 46-52

Author(s):

Emrah Inan ◽

Burak Yonyul ◽

Fatih Tekbacak

Keyword(s):

Big Data ◽

Data Models ◽

Use Cases ◽

Unstructured Data ◽

Entity Linking ◽

Domain Specific ◽

Storage Technology ◽

Integrated Environment ◽

Data Environment ◽

The Web

Most of the data on the web is non-structural, and it is required that the data should be transformed into a machine operable structure. Therefore, it is appropriate to convert the unstructured data into a structured form according to the requirements and to store those data in different data models by considering use cases. As requirements and their types increase, it fails using one approach to perform on all. Thus, it is not suitable to use a single storage technology to carry out all storage requirements. Managing stores with various type of schemas in a joint and an integrated manner is named as 'multistore' and 'polystore' in the database literature. In this paper, Entity Linking task is leveraged to transform texts into wellformed data and this data is managed by an integrated environment of different data models. Finally, this integrated big data environment will be queried and be examined by presenting the method.

Download Full-text

NoSQL Databases

Advances in Data Mining and Database Management - Handbook of Research on Cloud Infrastructures for Big Data Analytics ◽

10.4018/978-1-4666-5864-6.ch008 ◽

2014 ◽

pp. 186-215 ◽

Cited By ~ 2

Author(s):

Ganesh Chandra Deka

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Open Source ◽

Data Storage ◽

Big Data Processing ◽

Nosql Databases ◽

Data Intensive ◽

Huge Data ◽

Data Intensive Applications

NoSQL databases are designed to meet the huge data storage requirements of cloud computing and big data processing. NoSQL databases have lots of advanced features in addition to the conventional RDBMS features. Hence, the “NoSQL” databases are popularly known as “Not only SQL” databases. A variety of NoSQL databases having different features to deal with exponentially growing data-intensive applications are available with open source and proprietary option. This chapter discusses some of the popular NoSQL databases and their features on the light of CAP theorem.

Download Full-text

Big Data Analytics

Handbook of Research on Cloud Computing and Big Data Applications in IoT - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-5225-8407-0.ch004 ◽

2019 ◽

pp. 67-81

Author(s):

Nitigya Sambyal ◽

Poonam Saini ◽

Rupali Syal

Keyword(s):

Big Data ◽

Open Problem ◽

Data Analytics ◽

Big Data Analytics ◽

Data Capture ◽

Data Sets ◽

Full Potential ◽

Processing Application ◽

Nosql Databases ◽

The World

The world is increasingly driven by huge amounts of data. Big data refers to data sets that are so large or complex that traditional data processing application software are inadequate to deal with them. Healthcare analytics is a prominent area of big data analytics. It has led to significant reduction in morbidity and mortality associated with a disease. In order to harness full potential of big data, various tools like Apache Sentry, BigQuery, NoSQL databases, Hadoop, JethroData, etc. are available for its processing. However, with such enormous amounts of information comes the complexity of data management, other big data challenges occur during data capture, storage, analysis, search, transfer, information privacy, visualization, querying, and update. The chapter focuses on understanding the meaning and concept of big data, analytics of big data, its role in healthcare, various application areas, trends and tools used to process big data along with open problem challenges.

Download Full-text

Modeling and Indexing Spatiotemporal Trajectory Data in Non-Relational Databases

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Managing Big Data in Cloud Computing Environments ◽

10.4018/978-1-4666-9834-5.ch006 ◽

2016 ◽

pp. 133-162 ◽

Cited By ~ 7

Author(s):

Berkay Aydin ◽

Vijay Akkineni ◽

Rafal A Angryk

Keyword(s):

Data Storage ◽

Relational Databases ◽

Large Scale ◽

Database Systems ◽

Distributed Database ◽

Data Models ◽

Trajectory Data ◽

Nosql Databases ◽

Distributed Database Systems ◽

Partitioned Data

With the ever-growing nature of spatiotemporal data, it is inevitable to use non-relational and distributed database systems for storing massive spatiotemporal datasets. In this chapter, the important aspects of non-relational (NoSQL) databases for storing large-scale spatiotemporal trajectory data are investigated. Mainly, two data storage schemata are proposed for storing trajectories, which are called traditional and partitioned data models. Additionally spatiotemporal and non-spatiotemporal indexing structures are designed for efficiently retrieving data under different usage scenarios. The results of the experiments exhibit the advantages of utilizing data models and indexing structures for various query types.

Download Full-text

New Trends in Databases to NonSQL Databases

Encyclopedia of Information Science and Technology, Fifth Edition - Advances in Information Quality and Management ◽

10.4018/978-1-7998-3479-3.ch054 ◽

2021 ◽

pp. 791-799

Author(s):

Antonio Sarasa-Cabezuelo

Keyword(s):

Big Data ◽

Information Processing ◽

Real Time ◽

Relational Databases ◽

Data Models ◽

Relational Model ◽

Alternative Mechanism ◽

New Models ◽

Database Models

The appearance of the “big data” phenomenon has meant a change in the storage and information processing needs. This new context is characterized by 1) enormous amounts of information are available in heterogeneous formats and types, 2) information must be processed almost in real time, and 3) data models evolve periodically. Relational databases have limitations to respond to these needs in an optimal way. For these reasons, some companies such as Google or Amazon decided to create new database models (different from the relational model) that solve the needs raised in the context of big data without the limitations of relational databases. These new models are the origin of the so-called NonSQL databases. Currently, NonSQL databases have been constituted as an alternative mechanism to the relational model and its use is widely extended. The main objective of this chapter is to introduce the NonSQL databases.

Download Full-text