scholarly journals A Rule-based Conversion of an EER Schema to Neo4j Schema Constraints

2021 ◽  
Author(s):  
Telmo Henrique Valverde da Silva ◽  
Ronaldo dos Santos Mello

Several application domains hold highly connected data, like supply chain and social network. In this context, NoSQL graph databases raise as a promising solution since relationships are first class citizens in their data model. Nevertheless, a traditional database design methodology initially defines a conceptual schema of the domain data, and the Enhanced Entity-Relationship (EER) model is a common tool. This paper presents a rule-based conversion process from an EER schema to Neo4j schema constraints, as Neo4j is the most representative NoSQL graph database management system with an expressive data model. Different from related work, our conversion process deals with all EER model concepts and generates rules for ensuring schema constraints through a set of Cypher instructions ready to run into a Neo4j database instance, as Neo4J is a schemaless system, and it is not possible to create a schema a priori. We also present an experimental evaluation that demonstrates the viability of our process in terms of performance.

Database ◽  
2020 ◽  
Vol 2020 ◽  
Author(s):  
Claire M Simpson ◽  
Florian Gnad

Abstract Graph representations provide an elegant solution to capture and analyze complex molecular mechanisms in the cell. Co-expression networks are undirected graph representations of transcriptional co-behavior indicating (co-)regulations, functional modules or even physical interactions between the corresponding gene products. The growing avalanche of available RNA sequencing (RNAseq) data fuels the construction of such networks, which are usually stored in relational databases like most other biological data. Inferring linkage by recursive multiple-join statements, however, is computationally expensive and complex to design in relational databases. In contrast, graph databases store and represent complex interconnected data as nodes, edges and properties, making it fast and intuitive to query and analyze relationships. While graph-based database technologies are on their way from a fringe domain to going mainstream, there are only a few studies reporting their application to biological data. We used the graph database management system Neo4j to store and analyze co-expression networks derived from RNAseq data from The Cancer Genome Atlas. Comparing co-expression in tumors versus healthy tissues in six cancer types revealed significant perturbation tracing back to erroneous or rewired gene regulation. Applying centrality, community detection and pathfinding graph algorithms uncovered the destruction or creation of central nodes, modules and relationships in co-expression networks of tumors. Given the speed, accuracy and straightforwardness of managing these densely connected networks, we conclude that graph databases are ready for entering the arena of biological data.


2019 ◽  
Vol 30 (1) ◽  
pp. 41-60 ◽  
Author(s):  
Gustavo Cordeiro Galvão Van Erven ◽  
Rommel Novaes Carvalho ◽  
Waldeyr Mendes Cordeiro da Silva ◽  
Sergio Lifschitz ◽  
Harley Vera-Olivera ◽  
...  

In recent years, graph database systems have become very popular and been deployed mainly in situations where the relationship between data is significant, such as in social networks. Although they do not require a particular schema design, a data model contributes to their consistency. Designing diagrams is an approach to satisfying this demand for a conceptual data model. While researchers and companies have been developing concepts and notations for graph database modeling, their notations focus on their specific implementations. In this article, the authors propose a diagram to address this lack of a generic and comprehensive notation for graph databases modeling, named GRAPHED (Graph Description Diagram for Graph Databases). The authors verified the effectiveness and compatibility of GRAPHED in two case studies: fraud identification, and a biological network model.


2016 ◽  
Vol 64 (3) ◽  
pp. 457-466 ◽  
Author(s):  
A. Czerepicki

Abstract The article presents an innovative concept of applying graph databases in transport information systems. The model of a graph database has been presented together with implementation of data structures and search operations in a graph. The transformation concept of relational model to a graph data model has been developed. The schema of graph database has been proposed for public transport information system purposes. The realization methods have been illustrated by the use of search function based on the Cypher query language.


Relational databases are holding the maximum amount of data underpinning the web. They show excellent record of convenience and efficiency in repository, optimized query execution, scalability, security and accuracy. Recently graph databases are seen as an good replacement for relational database. When compared to the relational data model, graph data model is more vivid, strong and data expressed in it models relationships among data properly. An important requirement is to increase the vast quantities of data stored in RDB into web. In this situation, migration from relational to graph format is very advantageous. Both databases have advantages and limitations depending on the form of queries. Thus, this paper converts relational to graph database by utilizing the schema in order to develop a dual database system through migration, which merges the capability of both relational db and graph db. The experimental results are provided to demonstrate the practicability of the method and query response time over the target database. The proposed concept is proved by implementing it on MySQL and Neo4j


Author(s):  
Panji Wisnu Wirawan ◽  
Djalal Er Riyanto ◽  
Dinar Mutiara Kusumo Nugraheni ◽  
Yasmin Yasmin

Background: Semarang has broad area that cannot be covered entirely by single transportation mode. To reach a specific location, people often use more than one public transportation mode. Apart from Bus Rapid Transit, another exist namely angkot or city transportation. Multimodal traveler information is then  required to help passenger searching for a route. Several studies of multimodal traveler information system has been conducted, however the data model for multimodal transportation did not conceived in detail.Objective: Proposes a database of multimodal transportation design using graph data model by taking Semarang as a case study.Method: We create our model in oriented entity-relationship diagram (O-ERD) and map this O-ERD to the graph database schema.Result: We develop our data model in graph database schema and we implement the model using Neo4J graph database for validation purpose. Our model consist of  three graph node label namely Shelter, Angkot Stopper, and Closer Place. To validate our model, we execute a search query using the Cypher query to look for location with closer place to it.Conclusion: Our data model was successfully developed and implemented. Searching transportation route in the implementation of our model has been conducted using cypher query. It can successfully display all possible paths and routes. Our query can distinguish between one mode of transportation with another.Keywords: Graph database, Multimodal transportation, Neo4j, Cypher


Author(s):  
Kornelije Rabuzin ◽  
◽  
Sonja Ristić ◽  
Robert Kudelić ◽  
◽  
...  

In recent years, graph databases have become far more important. They have been proven to be an excellent choice for storing and managing large amounts of interconnected data. Since graph databases (GDB) rely on a graph data model based on graph theory, this study examines whether currently available graph database management systems support the principles of graph theory, and, if so, to what extent. We also show how these systems differ in terms of implementation and languages, and we also discuss which graph database management systems are used today and why.


Author(s):  
Laura Cristina Vázquez-De Los Santos ◽  
Griselda Cortes-Morales ◽  
Alicia Guadalupe Valdez-Menchaca ◽  
Diego Arnulfo Martínez-Perales

The objective of this article is to design a website for an educational institution with a dynamic data model that allows you to easily add, edit and update information. In the methodology, systems engineering concepts will be used during the system development process, documenting each stage. Carrying out the stages of requirements analysis and data model design, considering the parties involved. The Entity Relationship model was designed with the purpose of confirming the logical needs of the information. In addition, the relational model was created, where the attributes of each entity are detailed. MySQL was used as the database management system. Part of the design of the data model includes the way in which it interacts with it, for this the CRUD system is used. With the design of the data models: logical and database models, the script for the creation of the dynamic database was created, which will be used to store all the information relevant to the educational institution. As a result, the correct functionality of the database was guaranteed on the website.


2021 ◽  
Vol 22 (S2) ◽  
Author(s):  
Daniele D’Agostino ◽  
Pietro Liò ◽  
Marco Aldinucci ◽  
Ivan Merelli

Abstract Background High-throughput sequencing Chromosome Conformation Capture (Hi-C) allows the study of DNA interactions and 3D chromosome folding at the genome-wide scale. Usually, these data are represented as matrices describing the binary contacts among the different chromosome regions. On the other hand, a graph-based representation can be advantageous to describe the complex topology achieved by the DNA in the nucleus of eukaryotic cells. Methods Here we discuss the use of a graph database for storing and analysing data achieved by performing Hi-C experiments. The main issue is the size of the produced data and, working with a graph-based representation, the consequent necessity of adequately managing a large number of edges (contacts) connecting nodes (genes), which represents the sources of information. For this, currently available graph visualisation tools and libraries fall short with Hi-C data. The use of graph databases, instead, supports both the analysis and the visualisation of the spatial pattern present in Hi-C data, in particular for comparing different experiments or for re-mapping omics data in a space-aware context efficiently. In particular, the possibility of describing graphs through statistical indicators and, even more, the capability of correlating them through statistical distributions allows highlighting similarities and differences among different Hi-C experiments, in different cell conditions or different cell types. Results These concepts have been implemented in NeoHiC, an open-source and user-friendly web application for the progressive visualisation and analysis of Hi-C networks based on the use of the Neo4j graph database (version 3.5). Conclusion With the accumulation of more experiments, the tool will provide invaluable support to compare neighbours of genes across experiments and conditions, helping in highlighting changes in functional domains and identifying new co-organised genomic compartments.


2015 ◽  
Vol 27 (4) ◽  
pp. 369-388 ◽  
Author(s):  
Jang B. Singh

Purpose – The purpose of this paper was to examine changes in the contents of Canadian corporate codes of ethics over a period of two decades from an institutionalization perspective. Design/methodology/approach – The paper tracks changes in the contents of the codes of large Canadian corporations longitudinally by analyzing their contents at two points over two decades, in 1992 and 2012. In particular, the paper tests three hypotheses related to the institutionalization of codes. Findings – It was found that the codes have become more prescriptive, they are more concerned with social responsibility and are more likely to identify their moral and legal authority. Overall, the findings support an institutional interpretation of the observed changes. Research limitations/implications – While large corporations are critical in establishing new and innovative management practices, their selection as the study population limits the generalizabilty of the findings. Another limitation of this paper is that it used an a priori determined set of items to analyze the contents of the codes and while this was needed to facilitate the comparison across time, it also meant that some important items were not clearly identified. Originality/value – Codes of ethics are the foundation of ethics programs in corporations and their contents could be critical in the development of a culture of ethics in corporations. This paper makes a valuable contribution to research on business ethics by analyzing the codes of ethics of the largest corporations in Canada at two points over two decades. The need to track changes in corporate codes of ethics over time has been advocated by several researchers, but longitudinal studies in this area are rare.


Sign in / Sign up

Export Citation Format

Share Document