rdf graph Latest Research Papers

An Efficient Distributed SPARQL Query Processing Scheme Considering Communication Costs in Spark Environments

Applied Sciences ◽

10.3390/app12010122 ◽

2021 ◽

Vol 12 (1) ◽

pp. 122

Author(s):

Jongtae Lim ◽

Byounghoon Kim ◽

Hyeonbyeong Lee ◽

Dojin Choi ◽

Kyoungsoo Bok ◽

...

Keyword(s):

Query Processing ◽

Large Scale ◽

Distributed Processing ◽

Data Communication ◽

Sparql Query ◽

Query Execution ◽

Processing Scheme ◽

Communication Costs ◽

Rdf Graph ◽

Execution Path

Various distributed processing schemes were studied to efficiently utilize a large scale of RDF graph in semantic web services. This paper proposes a new distributed SPARQL query processing scheme considering communication costs in Spark environments to reduce I/O costs during SPARQL query processing. We divide a SPARQL query into several subqueries using a WHERE clause to process a query of an RDF graph stored in a distributed environment. The proposed scheme reduces data communication costs by grouping the divided subqueries in related nodes through the index and processing them, and the grouped subqueries calculate the cost of all possible query execution paths to select an efficient query execution path. The efficient query execution path is selected through the algorithm considering the data parsing cost of all possible query execution paths, amount of data communication, and queue time per node. It is shown through various performance evaluations that the proposed scheme outperforms the existing schemes.

DESIGN AND EVALUATION OF A BIM-GIS INTEGRATED INFORMATION MODEL USING RDF GRAPH DATABASE

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-viii-4-w2-2021-175-2021 ◽

2021 ◽

Vol VIII-4/W2-2021 ◽

pp. 175-182

Author(s):

A.-H. Hor ◽

G. Sohn

Keyword(s):

Performance Metrics ◽

Graph Model ◽

Information Model ◽

Query Languages ◽

Semantic Integration ◽

Graph Database ◽

Graph Databases ◽

Rdf Graph ◽

Functional Components ◽

Integrated Information

Abstract. The semantic integration modeling of BIM industry foundations classes and GIS City-geographic markup language are a milestone for many applications that involve both domains of knowledge. In this paper, we propose a system design architecture, and implementation of Extraction, Transformation and Loading (ETL) workflows of BIM and GIS model into RDF graph database model, these workflows were created from functional components and ontological frameworks supporting RDF SPARQL and graph databases Cypher query languages. This paper is about full understanding of whether RDF graph database is suitable for a BIM-GIS integrated information model, and it looks deeper into the assessment of translation workflows and evaluating performance metrics of a BIM-GIS integrated data model managed in an RDF graph database, the process requires designing and developing various pipelines of workflows with semantic tools in order to get the data and its structure into an appropriate format and demonstrate the potential of using RDF graph databases to integrate, manage and analyze information and relationships from both GIS and BIM models, the study also has introduced the concepts of Graph-Model occupancy indexes of nodes, attributes and relationships to measure queries outputs and giving insights on data richness and performance of the resulting BIM-GIS semantically integrated model.

A distributed framework to investigate the entity relatedness problem in large RDF knowledge bases

10.5753/sbbd.2021.17871 ◽

2021 ◽

Author(s):

Javier Guillot Jiménez ◽

Luiz André P. Paes Leme ◽

Yenier Torres Izquierdo ◽

Angelo Batista Neves ◽

Marco A. Casanova

Keyword(s):

Knowledge Base ◽

Search Strategy ◽

Real Data ◽

Search Space ◽

Knowledge Bases ◽

Search Strategies ◽

Rdf Graph ◽

Distributed Framework ◽

Path Search ◽

Entity Relatedness

The entity relatedness problem refers to the question of exploring a knowledge base, represented as an RDF graph, to discover and understand how two entities are connected. This question can be addressed by implementing a path search strategy, which combines an entity similarity measure, with an expansion limit, to reduce the path search space and a path ranking measure to order the relevant paths between a given pair of entities in the RDF graph. This paper first introduces DCoEPinKB, an in-memory distributed framework that addresses the entity relatedness problem. Then, it presents an evaluation of path search strategies using DCoEPinKB over real data collected from DBpedia. The results provide insights about the performance of the path search strategies.

Using the W3C Generating RDF from Tabular Data on the Web recommendation to manage small Wikidata datasets

Semantic Web ◽

10.3233/sw-210443 ◽

2021 ◽

pp. 1-23

Author(s):

Steven J. Baskauf ◽

Jessica K. Baskauf

Keyword(s):

Simple Form ◽

Data Model ◽

Graph Model ◽

Value Added ◽

Knowledge Graph ◽

Tabular Data ◽

Rdf Graph ◽

Current State ◽

Over Time ◽

The Web

The W3C Generating RDF from Tabular Data on the Web Recommendation provides a mechanism for mapping CSV-formatted data to any RDF graph model. Since the Wikibase data model used by Wikidata can be expressed as RDF, this Recommendation can be used to document tabular snapshots of parts of the Wikidata knowledge graph in a simple form that is easy for humans and applications to read. Those snapshots can be used to document how subgraphs of Wikidata have changed over time and can be compared with the current state of Wikidata using its Query Service to detect vandalism and value added through community contributions.

Reasoning about Explanations for Non-validation in SHACL

10.24963/kr.2021/2 ◽

2021 ◽

Author(s):

Shqiponja Ahmetaj ◽

Robert David ◽

Magdalena Ortiz ◽

Axel Polleres ◽

Bojken Shehu ◽

...

Keyword(s):

Decision Problems ◽

Data Complexity ◽

Input Graph ◽

Detailed Characterization ◽

Rdf Graph ◽

Set Inclusion ◽

Constraint Language ◽

Rdf Graphs ◽

The Given

The Shapes Constraint Language (SHACL) is a recently standardized language for describing and validating constraints over RDF graphs. The SHACL specification describes the so-called validation reports, which are meant to explain to the users the outcome of validating an RDF graph against a collection of constraints. Specifically, explaining the reasons why the input graph does not satisfy the constraints is challenging. In fact, the current SHACL standard leaves it open on how such explanations can be provided to the users. In this paper, inspired by works on logic-based abduction and database repairs, we study the problem of explaining non-validation of SHACL constraints. In particular, in our framework non-validation is explained using the notion of a repair, i.e., a collection of additions and deletions whose application on an input graph results in a repaired graph that does satisfy the given SHACL constraints. We define a collection of decision problems for reasoning about explanations, possibly restricting to explanations that are minimal with respect to cardinality or set inclusion. We provide a detailed characterization of the computational complexity of those reasoning tasks, including the combined and the data complexity.

Literal2Feature: An Automatic Scalable RDF Graph Feature Extractor

10.3233/ssw210036 ◽

2021 ◽

Author(s):

Farshad Bakhshandegan Moghaddam ◽

Carsten Draschner ◽

Jens Lehmann ◽

Hajira Jabeen

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Data Generation ◽

Fully Integrated ◽

Rdf Graph ◽

Big Data Technologies ◽

Rdf Data ◽

Low Dimensional ◽

Rdf Graphs

The last decades have witnessed significant advancements in terms of data generation, management, and maintenance. This has resulted in vast amounts of data becoming available in a variety of forms and formats including RDF. As RDF data is represented as a graph structure, applying machine learning algorithms to extract valuable knowledge and insights from them is not straightforward, especially when the size of the data is enormous. Although Knowledge Graph Embedding models (KGEs) convert the RDF graphs to low-dimensional vector spaces, these vectors often lack the explainability. On the contrary, in this paper, we introduce a generic, distributed, and scalable software framework that is capable of transforming large RDF data into an explainable feature matrix. This matrix can be exploited in many standard machine learning algorithms. Our approach, by exploiting semantic web and big data technologies, is able to extract a variety of existing features by deep traversing a given large RDF graph. The proposed framework is open-source, well-documented, and fully integrated into the active community project Semantic Analytics Stack (SANSA). The experiments on real-world use cases disclose that the extracted features can be successfully used in machine learning tasks like classification and clustering.

Adding Structure and Removing Duplicates in SPARQL Results with Nested Tables

10.3233/ssw210047 ◽

2021 ◽

Author(s):

Sébastien Ferré

Keyword(s):

Natural Language ◽

Sparql Query ◽

Formal Definition ◽

Graph Structure ◽

Flat Structure ◽

Rdf Graph ◽

Complex Queries ◽

Definition Of

The results of a SPARQL query are generally presented as a table with one row per result, and one column per projected variable. This is an immediate consequence of the formal definition of SPARQL results as a sequence of mappings from variables to RDF terms. However, because of the flat structure of tables, some of the RDF graph structure is lost. This often leads to duplicates in the contents of the table, and difficulties to read and interpret results. We propose to use nested tables to improve the presentation of SPARQL results. A nested table is a table where cells may contain embedded tables instead of RDF terms, and so recursively. We introduce an automated procedure that lifts flat tables into nested tables, based on an analysis of the query. We have implemented the procedure on top of Sparklis, a guided query builder in natural language, in order to further improve the readability of its UI. It can as well be implemented on any SPARQL querying interface as it only depends on the query and its flat results. We illustrate our proposal in the domain of pharmacovigilance, and evaluate it on complex queries over Wikidata.

QRDF: An efficient RDF graph processing system for fast query

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6441 ◽

2021 ◽

Author(s):

Menghan Jia ◽

Yiming Zhang ◽

Dongsheng Li

Keyword(s):

Processing System ◽

Graph Processing ◽

Rdf Graph

Load Balanced Semantic Aware Distributed RDF Graph

10.1145/3472163.3472167 ◽

2021 ◽

Author(s):

Ami Pandat ◽

Nidhi Gupta ◽

Minal Bhise

Keyword(s):

Rdf Graph ◽

Load Balanced

GSBRL : Efficient RDF graph storage based on reinforcement learning

World Wide Web ◽

10.1007/s11280-021-00919-x ◽

2021 ◽

Author(s):

Lei Zheng ◽

Ziming Shen ◽

Hongzhi Wang

Keyword(s):

Reinforcement Learning ◽

Rdf Graph

rdf graph
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Efficient Distributed SPARQL Query Processing Scheme Considering Communication Costs in Spark Environments

DESIGN AND EVALUATION OF A BIM-GIS INTEGRATED INFORMATION MODEL USING RDF GRAPH DATABASE

A distributed framework to investigate the entity relatedness problem in large RDF knowledge bases

Using the W3C Generating RDF from Tabular Data on the Web recommendation to manage small Wikidata datasets

Reasoning about Explanations for Non-validation in SHACL

Literal2Feature: An Automatic Scalable RDF Graph Feature Extractor

Adding Structure and Removing Duplicates in SPARQL Results with Nested Tables

QRDF: An efficient RDF graph processing system for fast query

Load Balanced Semantic Aware Distributed RDF Graph

GSBRL : Efficient RDF graph storage based on reinforcement learning

Export Citation Format

rdf graphRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Efficient Distributed SPARQL Query Processing Scheme Considering Communication Costs in Spark Environments

DESIGN AND EVALUATION OF A BIM-GIS INTEGRATED INFORMATION MODEL USING RDF GRAPH DATABASE

A distributed framework to investigate the entity relatedness problem in large RDF knowledge bases

Using the W3C Generating RDF from Tabular Data on the Web recommendation to manage small Wikidata datasets

Reasoning about Explanations for Non-validation in SHACL

Literal2Feature: An Automatic Scalable RDF Graph Feature Extractor

Adding Structure and Removing Duplicates in SPARQL Results with Nested Tables

QRDF: An efficient RDF graph processing system for fast query

Load Balanced Semantic Aware Distributed RDF Graph

GSBRL : Efficient RDF graph storage based on reinforcement learning

rdf graph
Recently Published Documents