The impact of hardware on database systems

The Impact of Big Data on Security

Big Data ◽

10.4018/978-1-4666-9840-6.ch068 ◽

2016 ◽

pp. 1495-1518

Author(s):

Mohammad Alaa Hussain Al-Hamami

Keyword(s):

Social Media ◽

Big Data ◽

Management System ◽

Database Management ◽

Database Systems ◽

Structured Data ◽

Database Management System ◽

Unstructured Data ◽

And Behavior ◽

The Impact

Big Data is comprised systems, to remain competitive by techniques emerging due to Big Data. Big Data includes structured data, semi-structured and unstructured. Structured data are those data formatted for use in a database management system. Semi-structured and unstructured data include all types of unformatted data including multimedia and social media content. Among practitioners and applied researchers, the reaction to data available through blogs, Twitter, Facebook, or other social media can be described as a “data rush” promising new insights about consumers' choices and behavior and many other issues. In the past Big Data has been used just by very large organizations, governments and large enterprises that have the ability to create its own infrastructure for hosting and mining large amounts of data. This chapter will show the requirements for the Big Data environments to be protected using the same rigorous security strategies applied to traditional database systems.

Download Full-text

Performance Analysis of NoSQL and Relational Databases with CouchDB and MySQL for Application’s Data Storage

Applied Sciences ◽

10.3390/app10238524 ◽

2020 ◽

Vol 10 (23) ◽

pp. 8524

Author(s):

Cornelia A. Győrödi ◽

Diana V. Dumşe-Burescu ◽

Doina R. Zmaranda ◽

Robert Ş. Győrödi ◽

Gianina A. Gabor ◽

...

Keyword(s):

Big Data ◽

Data Storage ◽

Relational Databases ◽

Database Systems ◽

Application Performance ◽

Big Data Applications ◽

Database Technology ◽

Important Challenge ◽

Big Data Application ◽

The Impact

In the current context of emerging several types of database systems (relational and non-relational), choosing the type and database system for storing large amounts of data in today’s big data applications has become an important challenge. In this paper, we aimed to provide a comparative evaluation of two popular open-source database management systems (DBMSs): MySQL as a relational DBMS and, more recently, as a non-relational DBMS, and CouchDB as a non-relational DBMS. This comparison was based on performance evaluation of CRUD (CREATE, READ, UPDATE, DELETE) operations for different amounts of data to show how these two databases could be modeled and used in an application and highlight the differences in the response time and complexity. The main objective of the paper was to make a comparative analysis of the impact that each specific DBMS has on application performance when carrying out CRUD requests. To perform the analysis and to ensure the consistency of tests, two similar applications were developed in Java, one using MySQL and the other one using CouchDB database; these applications were further used to evaluate the time responses for each database technology on the same CRUD operations on the database. Finally, a comprehensive discussion based on the results of the analysis was performed that centered on the results obtained and several conclusions were revealed. Advantages and drawbacks for each DBMS are outlined to support a decision for choosing a specific type of DBMS that could be used in a big data application.

Download Full-text

Analysis of the possibilities of optimizing SQL queries

Journal of Computer Sciences Institute ◽

10.35784/jcsi.2641 ◽

2021 ◽

Vol 19 ◽

pp. 151-158

Author(s):

Piotr Rymarski ◽

Grzegorz Kozieł

Keyword(s):

Relational Database ◽

Web Applications ◽

Query Language ◽

Database Systems ◽

Sql Server ◽

Management Systems ◽

Relational Database Systems ◽

The Impact ◽

Relational Database Management ◽

Test Scenarios

Most of today's web applications run on relational database systems. Communication with them is possible through statements written in Structured Query Language (SQL). This paper presents the most popular relational database management systems and describes common ways to optimize SQL queries. Using the research environment based on fragment of the imdb.com database, implementing OracleDb, MySQL, Microsoft SQL Server and PostgreSQL engines, a number of test scenarios were performed. The aim was to check the performance changes of SQL queries resulting from syntax modication while maintaining the result, the impact of database organization, indexing and advanced mechanisms aimed at increasing the eciency of operations performed, delivered in the systems used. The tests were carried out using a proprietary application written in Java using the Hibernate framework.

Download Full-text

Understanding and benchmarking the impact of GDPR on database systems

Proceedings of the VLDB Endowment ◽

10.14778/3384345.3384354 ◽

2020 ◽

Vol 13 (7) ◽

pp. 1064-1077 ◽

Cited By ~ 1

Author(s):

Supreeth Shastri ◽

Vinay Banakar ◽

Melissa Wasserman ◽

Arun Kumar ◽

Vijay Chidambaram

Keyword(s):

Database Systems ◽

The Impact

Download Full-text

Analyzing the Impact of XML Storage Models on the Performance of Native XML Database Systems – A Case Study

2010 Seventh International Conference on Information Technology: New Generations ◽

10.1109/itng.2010.207 ◽

2010 ◽

Cited By ~ 2

Author(s):

Ntima Mabanza

Keyword(s):

Database Systems ◽

Xml Database ◽

Native Xml Database ◽

The Impact

Download Full-text

Evaluating the survivability of Intrusion Tolerant Database systems and the impact of intrusion detection deficiencies

International Journal of Information and Computer Security ◽

10.1504/ijics.2007.013958 ◽

2007 ◽

Vol 1 (3) ◽

pp. 315 ◽

Cited By ~ 4

Author(s):

Hai Wang ◽

Peng Liu ◽

Lunquan Li

Keyword(s):

Intrusion Detection ◽

Database Systems ◽

The Impact ◽

Intrusion Tolerant

Download Full-text

The Impact of Network Layer on the Deadline Assignment Strategies in Distributed Real-Time Database Systems

Journal of Database Management ◽

10.4018/jdm.1996040103 ◽

1996 ◽

Vol 7 (2) ◽

pp. 24-33

Author(s):

Victor C.S. Lee ◽

Kam-Yiu Lam ◽

Kwok-Wa Lam ◽

Joseph K.Y. Ng

Keyword(s):

Real Time ◽

Database Systems ◽

Network Layer ◽

Real Time Database ◽

The Impact

Download Full-text

SQL and NoSQL Databases for Cyber Physical Production Systems in Internet of Things for Manufacturing (IoTfM)

Volume 2: Manufacturing Processes; Manufacturing Systems; Nano/Micro/Meso Manufacturing; Quality and Reliability ◽

10.1115/msec2021-63960 ◽

2021 ◽

Author(s):

David Gamero ◽

Andrew Dugenske ◽

Thomas Kurfess ◽

Christopher Saldana ◽

Katherine Fu

Keyword(s):

Internet Of Things ◽

Data Storage ◽

Large Scale ◽

Production Systems ◽

Database Systems ◽

Data Set ◽

Manufacturing Firm ◽

Performance Differences ◽

And Performance ◽

The Impact

Abstract In this paper, the design and performance differences between Relational Database Management Systems (RDBMS) and NoSQL Database Systems are examined, with attention to their applicability for real-world Internet of Things for manufacturing (IoTfM) data. While previous work has extensively compared SQL and NoSQL for both generalized and IoT uses, this work specifically examines the tradeoffs and performance differences for manufacturing applications by using a high-fidelity data set collected from a large US manufacturing firm. Growing an IoT system beyond the pilot stage requires scalable data storage; this work seeks to determine the impact of selected database systems on data write performance at scale. Payload size and message frequency were used as the primary characteristics to maintain model fidelity in simulated clients. As the number of simulated asset clients grow, the data write latency was calculated to determine how both database systems’ performance were affected. To isolate the RDBMS and NoSQL differences, a cloud environment was created using Amazon Web Services (AWS) with two identical data ingestion pipelines: writing data to an RDMBS (1) using AWS Aurora MySQL, and (2) using AWS DynamoDB NoSQL. The findings may provide guidance for further experimentation in large-scale manufacturing IoT implementations.

Download Full-text

Micro-architectural analysis of in-memory OLTP: Revisited

The VLDB Journal ◽

10.1007/s00778-021-00663-8 ◽

2021 ◽

Author(s):

Utku Sirin ◽

Pınar Tözün ◽

Danica Porobic ◽

Ahmad Yasin ◽

Anastasia Ailamaki

Keyword(s):

Execution Time ◽

Concurrency Control ◽

Database Systems ◽

Memory Systems ◽

Main Memory ◽

Control Mechanisms ◽

Instruction Cache ◽

Architectural Analysis ◽

Design Changes ◽

The Impact

AbstractMicro-architectural behavior of traditional disk-based online transaction processing (OLTP) systems has been investigated extensively over the past couple of decades. Results show that traditional OLTP systems mostly under-utilize the available micro-architectural resources. In-memory OLTP systems, on the other hand, process all the data in main-memory and, therefore, can omit the buffer pool. Furthermore, they usually adopt more lightweight concurrency control mechanisms, cache-conscious data structures, and cleaner codebases since they are usually designed from scratch. Hence, we expect significant differences in micro-architectural behavior when running OLTP on platforms optimized for in-memory processing as opposed to disk-based database systems. In particular, we expect that in-memory systems exploit micro-architectural features such as instruction and data caches significantly better than disk-based systems. This paper sheds light on the micro-architectural behavior of in-memory database systems by analyzing and contrasting it to the behavior of disk-based systems when running OLTP workloads. The results show that, despite all the design changes, in-memory OLTP exhibits very similar micro-architectural behavior to disk-based OLTP: more than half of the execution time goes to memory stalls where instruction cache misses or the long-latency data misses from the last-level cache (LLC) are the dominant factors in the overall execution time. Even though ground-up designed in-memory systems can eliminate the instruction cache misses, the reduction in instruction stalls amplifies the impact of LLC data misses. As a result, only 30% of the CPU cycles are used to retire instructions, and 70% of the CPU cycles are wasted to stalls for both traditional disk-based and new generation in-memory OLTP.

Download Full-text

Overview of Federated Facility to Harmonize, Analyze and Management of Missing Data in Cohorts

Applied Sciences ◽

10.3390/app9194103 ◽

2019 ◽

Vol 9 (19) ◽

pp. 4103 ◽

Cited By ~ 2

Author(s):

Hema Sekhar Reddy Rajula ◽

Veronika Odintsova ◽

Mirko Manchia ◽

Vassilios Fanos

Keyword(s):

Missing Data ◽

Cohort Studies ◽

Fixed Effects ◽

Statistical Power ◽

Evidence Synthesis ◽

Meta Analysis ◽

Database Systems ◽

Contributing Factors ◽

Meta Analyses ◽

The Impact

Cohorts are instrumental for epidemiologically oriented observational studies. Cohort studies usually observe large groups of individuals for a specific period of time to identify the contributing factors to a specific outcome (for instance an illness) and create associations between risk factors and the outcome under study. In collaborative projects, federated data facilities are meta-database systems that are distributed across multiple locations that permit to analyze, combine, or harmonize data from different sources making them suitable for mega- and meta-analyses. The harmonization of data can increase the statistical power of studies through maximization of sample size, allowing for additional refined statistical analyses, which ultimately lead to answer research questions that could not be addressed while using a single study. Indeed, harmonized data can be analyzed through mega-analysis of raw data or fixed effects meta-analysis. Other types of data might be analyzed by e.g., random-effects meta-analyses or Bayesian evidence synthesis. In this article, we describe some methodological aspects related to the construction of a federated facility to optimize analyses of multiple datasets, the impact of missing data, and some methods for handling missing data in cohort studies.

Download Full-text