Columnar storage and list-based processing for graph database management systems

We revisit column-oriented storage and query processing techniques in the context of contemporary graph database management systems (GDBMSs). Similar to column-oriented RDBMSs, GDBMSs support read-heavy analytical workloads that however have fundamentally different data access patterns than traditional analytical workloads. We first derive a set of desiderata for optimizing storage and query processors of GDBMS based on their access patterns. We then present the design of columnar storage, compression, and query processing techniques based on these desiderata. In addition to showing direct integration of existing techniques from columnar RDBMSs, we also propose novel ones that are optimized for GDBMSs. These include a novel list-based query processor, which avoids expensive data copies of traditional block-based processors under many-to-many joins, a new data structure we call single-indexed edge property pages and an accompanying edge ID scheme, and a new application of Jacobson's bit vector index for compressing NULL values and empty lists. We integrated our techniques into the GraphflowDB in-memory GDBMS. Through extensive experiments, we demonstrate the scalability and query performance benefits of our techniques.

Download Full-text

Query processing in main memory database management systems

Proceedings of the 1986 ACM SIGMOD international conference on Management of data - SIGMOD '86 ◽

10.1145/16894.16878 ◽

1986 ◽

Cited By ~ 60

Author(s):

Tobin J. Lehman ◽

Michael J. Carey

Keyword(s):

Query Processing ◽

Database Management ◽

Database Management Systems ◽

Main Memory ◽

Management Systems ◽

Main Memory Database

Download Full-text

Query processing in main memory database management systems

ACM SIGMOD Record ◽

10.1145/16856.16878 ◽

1986 ◽

Vol 15 (2) ◽

pp. 239-250 ◽

Cited By ~ 16

Author(s):

Tobin J. Lehman ◽

Michael J. Carey

Keyword(s):

Query Processing ◽

Database Management ◽

Database Management Systems ◽

Main Memory ◽

Management Systems ◽

Main Memory Database

Download Full-text

A+ Indexes: Tunable and Space-Efficient Adjacency Lists in Graph Database Management Systems

2021 IEEE 37th International Conference on Data Engineering (ICDE) ◽

10.1109/icde51399.2021.00130 ◽

2021 ◽

Author(s):

Amine Mhedhbi ◽

Pranjal Gupta ◽

Shahid Khaliq ◽

Semih Salihoglu

Keyword(s):

Database Management ◽

Database Management Systems ◽

Management Systems ◽

Graph Database

Download Full-text

Which Category Is Better: Benchmarking Relational and Graph Database Management Systems

Data Science and Engineering ◽

10.1007/s41019-019-00110-3 ◽

2019 ◽

Vol 4 (4) ◽

pp. 309-322 ◽

Cited By ~ 2

Author(s):

Yijian Cheng ◽

Pengjie Ding ◽

Tongtong Wang ◽

Wei Lu ◽

Xiaoyong Du

Keyword(s):

Database Management ◽

Database Management Systems ◽

Management Systems ◽

Graph Database ◽

Graph Data ◽

First Choice ◽

Data Graph ◽

Relational Database Management ◽

Relational Database Management Systems ◽

Aggregation Operations

Abstract Over decades, relational database management systems (RDBMSs) have been the first choice to manage data. Recently, due to the variety properties of big data, graph database management systems (GDBMSs) have emerged as an important complement to RDBMSs. As pointed out in the existing literature, both RDBMSs and GDBMSs are capable of managing graph data and relational data; however, the boundaries of them still remain unclear. For this reason, in this paper, we first extend a unified benchmark for RDBMSs and GDBMSs over the same datasets using the same query workload under the same metrics. We then conduct extensive experiments to evaluate them and make the following findings: (1) RDBMSs outperform GDMBSs by a substantial margin under the workloads which mainly consist of group by, sort, and aggregation operations, and their combinations; (2) GDMBSs show their superiority under the workloads that mainly consist of multi-table join, pattern match, path identification, and their combinations.

Download Full-text

Ontology Based Query Processing in Database Management Systems

On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE - Lecture Notes in Computer Science ◽

10.1007/978-3-540-39964-3_53 ◽

2003 ◽

pp. 839-857 ◽

Cited By ~ 9

Author(s):

Chokri Ben Necib ◽

Johann-Christoph Freytag

Keyword(s):

Query Processing ◽

Database Management ◽

Database Management Systems ◽

Management Systems

Download Full-text

Secure query processing in distributed database management systems-design and performance studies

[1990] Proceedings of the Sixth Annual Computer Security Applications Conference ◽

10.1109/csac.1990.143756 ◽

2002 ◽

Cited By ~ 1

Author(s):

B. Thuraisingham ◽

A. Kamon

Keyword(s):

Query Processing ◽

Performance Studies ◽

Database Management ◽

Systems Design ◽

Distributed Database ◽

Database Management Systems ◽

Management Systems ◽

And Performance ◽

Distributed Database Management ◽

Secure Query Processing

Download Full-text

Usage of graph databases for social graph modeling

Bulletin of V.N. Karazin Kharkiv National University, series «Mathematical modeling. Information technology. Automated control systems» ◽

10.26565/2304-6201-2019-43-06 ◽

2019 ◽

Keyword(s):

Social Network ◽

Database Management ◽

Query Languages ◽

Database Management Systems ◽

Management Systems ◽

Graph Database ◽

Graph Modeling ◽

The Social ◽

Execution Speed ◽

To Receive

This article is devoted to graph database management systems. The main characteristics and capabilities of those systems have been contemplated. The problems that may occur during the social network development have been selected to be solved using a graph data model. The most popular database management systems nowadays, namely, Neo4J, OrientDB and ArangoDB have been chosen for the study. Such characteristics of the selected databases as whether the software is proprietary or freely distributed, whether databases have up-to-date documentation or not, whether they are supported by developers, whether there is a community where you can get answers to your questions, and how much time is needed to master the database have been elaborated. The typical social network queries, when you need to receive results with a large depth of search quickly, have been developed using the query languages Cypher, OrientDB SQL and AQL used in Neo4J, OrientDB and ArangoDB respectively. The comparison of query execution speed has been performed for the selected databases. For this purpose, a graph that has 5000 nodes and 24900 connections has been built by implementing the Barabashi-Albert model for generating random-scale networks. The test tasks for finding friends of three users with the depth of 5 have been generated. The average time for each request has been estimated for several executions. The conclusions have been drawn and the recommendations regarding the selection of the best graph database for social network implementation have been made.

Download Full-text

A Range Query Processing Algorithm Hiding Data Access Patterns in Outsourced Database Environment

Data Mining and Big Data - Lecture Notes in Computer Science ◽

10.1007/978-3-319-40973-3_44 ◽

2016 ◽

pp. 434-446 ◽

Cited By ~ 3

Author(s):

Hyeong-Il Kim ◽

Hyeong-Jin Kim ◽

Jae-Woo Chang

Keyword(s):

Query Processing ◽

Data Access ◽

Range Query ◽

Processing Algorithm ◽

Data Access Patterns ◽

Access Patterns ◽

Outsourced Database ◽

Range Query Processing

Download Full-text

GRAPH DATABASE MANAGEMENT SYSTEMS AND GRAPH THEORY

Conference Proceedings (part of ITEMA conference collection) ◽

10.31410/itema.2020.39 ◽

2020 ◽

Author(s):

Kornelije Rabuzin ◽

◽

Sonja Ristić ◽

Robert Kudelić ◽

◽

...

Keyword(s):

Graph Theory ◽

Data Model ◽

Database Management ◽

Database Management Systems ◽

Management Systems ◽

Graph Database ◽

Graph Databases ◽

Graph Data ◽

Model Based

In recent years, graph databases have become far more important. They have been proven to be an excellent choice for storing and managing large amounts of interconnected data. Since graph databases (GDB) rely on a graph data model based on graph theory, this study examines whether currently available graph database management systems support the principles of graph theory, and, if so, to what extent. We also show how these systems differ in terms of implementation and languages, and we also discuss which graph database management systems are used today and why.

Download Full-text