Graph Pattern Index for Neo4j Graph Databases

Graph pattern matching that aims to seek out answer graphs in a data graph matching a provided graph, plays a fundamental role as a part of graph search for graph databases. “Matching” indicates that the two graphs are correlated, such as bisimulation, isomorphism, simulation, etc. The strictness of bisimulation is between simulation and isomorphism. Seldom work has been done to search for bisimulation subgraphs. This research focuses on the problem. The symbol [Formula: see text] is introduced to fundamental modal logic language, thereby yielding [Formula: see text] language; the symbols [Formula: see text] is added for forming [Formula: see text] formulas. Then conclusions about graph bisimulations are shown. Subsequently, a theorem with detailed proof is presented, stating that [Formula: see text] formulas characterize finite directed graphs modulo bisimulation. According to the conclusions and theorem, algorithms for finding subgraphs are proposed. After dividing the query graph, the match graphs undergo the characterization using [Formula: see text] formulas. In the data graphs, by model checking the formulas, the answer graphs exhibiting bisimilarity to the match graphs are able to be captured.

Download Full-text

Approximating Graph Pattern Queries Using Views

Proceedings of the 25th ACM International on Conference on Information and Knowledge Management - CIKM '16 ◽

10.1145/2983323.2983766 ◽

2016 ◽

Cited By ~ 2

Author(s):

Jia Li ◽

Yang Cao ◽

Xudong Liu

Keyword(s):

Graph Pattern

Download Full-text

Advantages of using graph databases to explore chromatin conformation capture experiments

BMC Bioinformatics ◽

10.1186/s12859-020-03937-0 ◽

2021 ◽

Vol 22 (S2) ◽

Author(s):

Daniele D’Agostino ◽

Pietro Liò ◽

Marco Aldinucci ◽

Ivan Merelli

Keyword(s):

Web Application ◽

High Throughput Sequencing ◽

Cell Types ◽

Graph Database ◽

Graph Databases ◽

Sources Of Information ◽

Chromosome Conformation ◽

Wide Scale ◽

User Friendly ◽

Different Cell Types

Abstract Background High-throughput sequencing Chromosome Conformation Capture (Hi-C) allows the study of DNA interactions and 3D chromosome folding at the genome-wide scale. Usually, these data are represented as matrices describing the binary contacts among the different chromosome regions. On the other hand, a graph-based representation can be advantageous to describe the complex topology achieved by the DNA in the nucleus of eukaryotic cells. Methods Here we discuss the use of a graph database for storing and analysing data achieved by performing Hi-C experiments. The main issue is the size of the produced data and, working with a graph-based representation, the consequent necessity of adequately managing a large number of edges (contacts) connecting nodes (genes), which represents the sources of information. For this, currently available graph visualisation tools and libraries fall short with Hi-C data. The use of graph databases, instead, supports both the analysis and the visualisation of the spatial pattern present in Hi-C data, in particular for comparing different experiments or for re-mapping omics data in a space-aware context efficiently. In particular, the possibility of describing graphs through statistical indicators and, even more, the capability of correlating them through statistical distributions allows highlighting similarities and differences among different Hi-C experiments, in different cell conditions or different cell types. Results These concepts have been implemented in NeoHiC, an open-source and user-friendly web application for the progressive visualisation and analysis of Hi-C networks based on the use of the Neo4j graph database (version 3.5). Conclusion With the accumulation of more experiments, the tool will provide invaluable support to compare neighbours of genes across experiments and conditions, helping in highlighting changes in functional domains and identifying new co-organised genomic compartments.

Download Full-text

Application of Petersen graph pattern technique for automated detection of heart valve diseases with PCG signals

Information Sciences ◽

10.1016/j.ins.2021.01.088 ◽

2021 ◽

Vol 565 ◽

pp. 91-104

Author(s):

Turker Tuncer ◽

Sengul Dogan ◽

Ru-San Tan ◽

U. Rajendra Acharya

Keyword(s):

Heart Valve ◽

Automated Detection ◽

Petersen Graph ◽

Heart Valve Diseases ◽

Graph Pattern ◽

Valve Diseases ◽

Pcg Signals

Download Full-text

Applying graph database technology for analyzing perturbed co-expression networks in cancer

Database ◽

10.1093/database/baaa110 ◽

2020 ◽

Vol 2020 ◽

Author(s):

Claire M Simpson ◽

Florian Gnad

Keyword(s):

Relational Databases ◽

Molecular Mechanisms ◽

Biological Data ◽

Database Management System ◽

Graph Database ◽

Graph Databases ◽

Graph Representations ◽

Rnaseq Data ◽

Database Technology ◽

Speed Accuracy

Abstract Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.

Download Full-text

Understanding the graph databases and power grid systems

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/439/3/032110 ◽

2018 ◽

Vol 439 ◽

pp. 032110 ◽

Cited By ~ 1

Author(s):

Xuming Lv ◽

Shuo Chen ◽

Shanqi Zheng ◽

Jingzhao Luan ◽

Yonghui Guo

Keyword(s):

Power Grid ◽

Graph Databases ◽

Grid Systems

Download Full-text

Efficient Access Methods for Very Large Distributed Graph Databases

Information Sciences ◽

10.1016/j.ins.2021.05.047 ◽

2021 ◽

Author(s):

David Luaces ◽

José R.R. Viqueira ◽

José M. Cotos ◽

Julián C. Flores

Keyword(s):

Graph Databases ◽

Access Methods ◽

Efficient Access

Download Full-text

Data Profiling in Property Graph Databases

Journal of Data and Information Quality ◽

10.1145/3409473 ◽

2020 ◽

Vol 12 (4) ◽

pp. 1-27

Author(s):

Sofía Maiolo ◽

Lorena Etcheverry ◽

Adriana Marotta

Keyword(s):

Graph Databases ◽

Data Profiling

Download Full-text

Multi-Fuzzy-Objective Graph Pattern Matching with Big Graph Data

Journal of Database Management ◽

10.4018/jdm.2019100102 ◽

2019 ◽

Vol 30 (4) ◽

pp. 24-40

Author(s):

Lei Li ◽

Fang Zhang ◽

Guanfeng Liu

Keyword(s):

Pattern Matching ◽

Particle Swarm Optimization Algorithm ◽

Nsga Ii ◽

Specific Domain ◽

Graph Pattern Matching ◽

Graph Data ◽

Graph Pattern ◽

Big Graph ◽

Complex Relationships ◽

Pattern Graph

Big graph data is different from traditional data and they usually contain complex relationships and multiple attributes. With the help of graph pattern matching, a pattern graph can be designed, satisfying special personal requirements and locate the subgraphs which match the required pattern. Then, how to locate a graph pattern with better attribute values in the big graph effectively and efficiently becomes a key problem to analyze and deal with big graph data, especially for a specific domain. This article introduces fuzziness into graph pattern matching. Then, a genetic algorithm, specifically an NSGA-II algorithm, and a particle swarm optimization algorithm are adopted for multi-fuzzy-objective optimization. Experimental results show that the proposed approaches outperform the existing approaches effectively.

Download Full-text