Data-Centric Benchmarking

Advances in Computer and Electrical Engineering - Advanced Methodologies and Technologies in Network Architecture, Mobile Computing, and Data Analytics ◽

10.4018/978-1-5225-7598-6.ch025 ◽

2019 ◽

pp. 342-353

Author(s):

Jérôme Darmont

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Decision Support ◽

Data Management ◽

Present State ◽

Transaction Processing ◽

Real System ◽

Future Trends ◽

On Line ◽

Line Analysis

In data management, both system designers and users casually resort to performance evaluation. Performance evaluation by experimentation on a real system is generally referred to as benchmarking. The aim of this chapter is to present an overview of the major past and present state-of-the-art data-centric benchmarks. This review includes the TPC standard benchmarks, but also alternative or more specialized benchmarks. Surveyed benchmarks are categorized into three families: transaction benchmarks aimed at on-line transaction processing (OLTP), decision-support benchmarks aimed at on-line analysis processing (OLAP), and big data benchmarks. Issues, tradeoffs, and future trends in data-centric benchmarking are also discussed.

Download Full-text

Data-Centric Benchmarking

Encyclopedia of Information Science and Technology, Fourth Edition ◽

10.4018/978-1-5225-2255-3.ch154 ◽

2018 ◽

pp. 1772-1782 ◽

Cited By ~ 1

Author(s):

Jérôme Darmont

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Decision Support ◽

Data Management ◽

Present State ◽

Transaction Processing ◽

Real System ◽

Future Trends ◽

On Line ◽

Line Analysis

In data management, both system designers and users casually resort to performance evaluation. Performance evaluation by experimentation on a real system is generally referred to as benchmarking. The aim of this chapter is to present an overview of the major past and present state-of-the-art data-centric benchmarks. This review includes the TPC standard benchmarks, but also alternative or more specialized benchmarks. Surveyed benchmarks are categorized into three families: transaction benchmarks aimed at On-Line Transaction Processing (OLTP), decision-support benchmarks aimed at On-Line Analysis Processing (OLAP) and big data benchmarks. Issues, tradeoffs and future trends in data-centric benchmarking are also discussed.

Download Full-text

Data Warehouse Benchmarking with DWEB

Progressive Methods in Data Warehousing and Business Intelligence ◽

10.4018/978-1-60566-232-9.ch015 ◽

2011 ◽

pp. 302-323

Author(s):

Jérôme Darmont

Keyword(s):

Performance Evaluation ◽

Decision Support ◽

Data Warehouse ◽

Database Management ◽

Ad Hoc ◽

Synthetic Data ◽

Transaction Processing ◽

Data Warehouses ◽

On Line ◽

Analytical Processing

Performance evaluation is a key issue for designers and users of Database Management Systems (DBMSs). Performance is generally assessed with software benchmarks that help, for example test architectural choices, compare different technologies, or tune a system. In the particular context of data warehousing and On-Line Analytical Processing (OLAP), although the Transaction Processing Performance Council (TPC) aims at issuing standard decision-support benchmarks, few benchmarks do actually exist. We present in this chapter the Data Warehouse Engineering Benchmark (DWEB), which allows generating various ad-hoc synthetic data warehouses and workloads. DWEB is fully parameterized to fulfill various data warehouse design needs. However, two levels of parameterization keep it relatively easy to tune. We also expand on our previous work on DWEB by presenting its new Extract, Transform, and Load (ETL) feature, as well as its new execution protocol. A Java implementation of DWEB is freely available online, which can be interfaced with most existing relational DMBSs. To the best of our knowledge, DWEB is the only easily available, up-to-date benchmark for data warehouses.

Download Full-text

Big Data Management Performance Evaluation in Hadoop Ecosystem

2017 3rd International Conference on Big Data Computing and Communications (BIGCOM) ◽

10.1109/bigcom.2017.26 ◽

2017 ◽

Cited By ~ 1

Author(s):

Qing Liu ◽

Yinjin Fu ◽

Guiqiang Ni ◽

Jianmin Mei

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Data Management ◽

Management Performance ◽

Hadoop Ecosystem

Download Full-text

Database Benchmarks

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch151 ◽

2011 ◽

pp. 950-954

Author(s):

Jérôme Darmont

Keyword(s):

Performance Evaluation ◽

Decision Support ◽

System Performance ◽

Performance Metrics ◽

Actual Behavior ◽

Future Trends ◽

Measurement Tools ◽

Object Relational ◽

Model Set ◽

System Performance Evaluation

Performance measurement tools are very important, both for designers and users of Database Management Systems (DBMSs). Performance evaluation is useful to designers to determine elements of architecture, and, more generally, to validate or refute hypotheses regarding the actual behavior of a DBMS. Thus, performance evaluation is an essential component in the development process of well-designed and efficient systems. Users may also employ performance evaluation, either to compare the efficiency of different technologies before selecting a DBMS, or to tune a system. Performance evaluation by experimentation on a real system is generally referred to as benchmarking. It consists of performing a series of tests on a given DBMS to estimate its performance in a given setting. Typically, a benchmark is constituted of two main elements: a database model (conceptual schema and extension), and a workload model (set of read and write operations) to apply on this database, following a predefined protocol. Most benchmarks also include a set of simple or composite performance metrics such as response time, throughput, number of input/output, disk or memory usage, and so forth. The aim of this article is to present an overview of the major families of state-of-the-art database benchmarks, namely, relational benchmarks, object and object-relational benchmarks, XML benchmarks, and decision-support benchmarks; and to discuss the issues, tradeoffs, and future trends in database benchmarking. We particularly focus on XML and decision-support benchmarks, which are currently the most innovative tools that are developed in this area.

Download Full-text

Data management: Present state and future trends

10.1190/1.1822855 ◽

1994 ◽

Author(s):

W. E. Bartz

Keyword(s):

Data Management ◽

Present State ◽

Future Trends

Download Full-text

NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management

International Journal of Database Theory and Application ◽

10.14257/ijdta.2014.7.6.11 ◽

2014 ◽

Vol 7 (6) ◽

pp. 121-130 ◽

Cited By ~ 9

Author(s):

A. B. M. Moniruzzaman

Keyword(s):

Big Data ◽

Data Management ◽

Transaction Processing ◽

Next Generation

Download Full-text

Design and Performance Evaluation of Mobile Personal Big Data Management System based on ICN

Internet Technology Letters ◽

10.1002/itl2.237 ◽

2020 ◽

Author(s):

Lijun Chen ◽

Luca Sciullo ◽

Angelo Trotta

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Data Management ◽

Management System ◽

Data Management System ◽

And Performance

Download Full-text

Biomedical Data Warehouses

Encyclopedia of Healthcare Information Systems ◽

10.4018/978-1-59904-889-5.ch021 ◽

2008 ◽

pp. 149-156 ◽

Cited By ~ 1

Author(s):

Jérôme Darmont ◽

Emerson Olivier

Keyword(s):

Data Warehouse ◽

Data Warehousing ◽

Large Data ◽

Complex Data ◽

Biomedical Data ◽

Future Trends ◽

Data Warehouses ◽

On Line ◽

Line Analysis ◽

Existing Data

In this context, the warehouse measures, though not necessarily numerical, remain the indicators for analysis, and analysis is still performed following different perspectives represented by dimensions. Large data volumes and their dating are other arguments in favor of this approach (Darmont et al., 2003). Data warehousing can also support various types of analysis, such as statistical reporting, on-line analysis (OLAP) and data mining. The aim of this article is to present an overview of the existing data warehouses for biomedical data and to discuss the issues and future trends in biomedical data warehousing. We illustrate this topic by presenting the design of an innovative, complex data warehouse for personal, anticipative medicine.

Download Full-text

Abstracting Data With A Decision Support System Based on Data Warehouses for the Evaluation of Ship Form Alternatives

Journal of Ship Production ◽

10.5957/jsp.2006.22.4.248 ◽

2006 ◽

Vol 22 (04) ◽

pp. 248-252

Author(s):

Song Sang ◽

Hua-jun Li

Keyword(s):

Data Mining ◽

Decision Support ◽

Decision Support System ◽

Support System ◽

Data Abstraction ◽

Data Warehouses ◽

Development Environment ◽

On Line ◽

Line Analysis ◽

Theory Framework

A new aided decision support system (DSS) based on data warehouses is discussed. It is composed of data warehouses, on-line analysis processing, and data mining, which is new in the field of DSS. The essential principle, the setting up of the model, the development environment, the main system interface, and a sketch of the theory framework of the new DSS architecture are also described. The decision support system was applied to data abstraction for evaluating ship form scenarios. Tests have shown it to be practical and dependable in complex systems, such as in the demonstration of ship forms.

Download Full-text