Efficient XML-to-SQL Query Translation

Author(s):  
Rajasekar Krishnamurthy ◽  
Raghav Kaushik ◽  
Jeffrey F Naughton
Keyword(s):  
Author(s):  
Rajasekar Krishnamurthy ◽  
Raghav Kaushik ◽  
Jeffrey F Naughton

Semantic Web ◽  
2021 ◽  
pp. 1-34
Author(s):  
David Chaves-Fraga ◽  
Edna Ruckhaus ◽  
Freddy Priyatna ◽  
Maria-Esther Vidal ◽  
Oscar Corcho

Ontology-Based Data Access (OBDA) has traditionally focused on providing a unified view of heterogeneous datasets (e.g., relational databases, CSV and JSON files), either by materializing integrated data into RDF or by performing on-the-fly querying via SPARQL query translation. In the specific case of tabular datasets represented as several CSV or Excel files, query translation approaches have been applied by considering each source as a single table that can be loaded into a relational database management system (RDBMS). Nevertheless, constraints over these tables are not represented (e.g., referential integrity among sources, datatypes, or data integrity); thus, neither consistency among attributes nor indexes over tables are enforced. As a consequence, efficiency of the SPARQL-to-SQL translation process may be affected, as well as the completeness of the answers produced during the evaluation of the generated SQL query. Our work is focused on applying implicit constraints on the OBDA query translation process over tabular data. We propose Morph-CSV, a framework for querying tabular data that exploits information from typical OBDA inputs (e.g., mappings, queries) to enforce constraints that can be used together with any SPARQL-to-SQL OBDA engine. Morph-CSV relies on both a constraint component and a set of constraint operators. For a given set of constraints, the operators are applied to each type of constraint with the aim of enhancing query completeness and performance. We evaluate Morph-CSV in several domains: e-commerce with the BSBM benchmark; transportation with the GTFS-Madrid benchmark; and biology with a use case extracted from the Bio2RDF project. We compare and report the performance of two SPARQL-to-SQL OBDA engines, without and with the incorporation of Morph-CSV. The observed results suggest that Morph-CSV is able to speed up the total query execution time by up to two orders of magnitude, while it is able to produce all the query answers.


Author(s):  
Ibrahim Dweib ◽  
Joan Lu

This chapter presents the system architecture, and implementation tools used for evaluating the MAXDOR model. The chapter also presents the main classes created to demonstrate the methodology for mapping XML document into relational database, rebuilding XML document from relational database, updating the content of XML document stored in relational database, XPath-To-SQL query translation, and building the result in XML format. Application on a case study is also presented. XML data sets from selected XML bench marks and XML data repository will be identified to be used for testing and evaluating the model. Finally, the chapter concludes with a summary.


Sign in / Sign up

Export Citation Format

Share Document