DocBase

Author(s):  
Arijit Sengupta ◽  
Ramesh Venkataraman

This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.

2011 ◽  
Vol 22 (4) ◽  
pp. 30-56
Author(s):  
Arijit Sengupta ◽  
Ramesh Venkataraman

This article introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.


Author(s):  
Shi-Kuo Chang ◽  
Gennaro Costagliola ◽  
Erland Jungert ◽  
Karin Camara

Sensor data fusion imposes a number of novel requirements on query languages and query processing techniques. A spatial/temporal query language called SQL has been proposed to support the retrieval of multimedia information from multiple sources and databases. This chapter investigates intelligent querying techniques including fusion techniques, multimedia data transformations, interactive progressive query building and SQL query processing techniques using sensor data fusion. The authors illustrate and discuss tasks and query patterns for information fusion, provide a number of examples of iterative queries and show the effectiveness of SQL in a command-action scenario.


Author(s):  
Jingwei Cheng ◽  
Z. M. Ma ◽  
Qiang Tong

RDF plays an important role in representing Web resources in a natural and flexible way. As the amount of RDF datasets increasingly growing, storing and querying theses data have attracted the attention of more and more researchers. In this chapter, we first make a review of approaches for query processing of RDF datasets. We categorize existing methods as two classes, those making use of RDBMS to implement the storage and retrieval, and those devising their own native storage schemas. They are called Relational RDF Stores and Native Stores respectively. Secondly, we survey some important extensions of SPARQL, standard query language for RDF, which extend the expressing power of SPARQL to allow more sophisticated language constructs that meet the needs from various application scenarios.


Author(s):  
Manoranjan Parhi ◽  
B. M. Acharya ◽  
B. Puthal

The application of wireless sensor network is emerging as a new trend in different sphere of modern society. However due to the advancement of SWE, designing & discovering sensor web registry services throughout heterogeneous environments is becoming a challenging task and raises several concerns like performance, reliability, and robustness. Many approaches and frameworks have been proposed to discover the sensor web registry services. Some of the approaches assume that the requests are placed in SOAP compatible formats while others focus on GUI based parametric query processing. We have formulated an approach that uses the Natural Language Query Processing which is a convenient and easy method of data access, especially for casual users who do not understand complicated database query languages such as SQL or XML based Query Language like XQuery and XPath. SOA is the proven technology for designing an efficient Sensor Web Registry by describing various parameters and sensor web services needed. We also propose an architecture based on x-SOA that organizes the method of sensor web registry service discovery in an efficient and structured manner using an intermediary, requester friendly layer called the Request Parser & Query Generator (RPQ) between the service provider and service requester via a service registry. We describe how RPQ facilitates the processing of plain text request query to a most appropriate sensor web service and also an algorithm with implementation for a complete cycle of sensor web registry service discovery.


Author(s):  
Paulo Caetano da Silva

Analytical processing (OLAP) tools typically only deal with relational data. Hence, the analytical processing systems on XML data do not have all the functionality provided by OLAP tools to traditional data (i.e. relational). In addition, current commercial and academic OLAP tools do not process XML data that contains XLink. Therefore, there is a need to develop a solution for OLAP systems in order to assist in the strategic analysis of the organizational data represented in XML format. Aiming at overcoming this issue, this chapter proposes an analytical system composed by LMDQL (Link-Based Multidimensional Query Language), an analytical query language; XLDM (XLink Data Metamodel), a metamodel given to model cubes of XML documents with XLink and to deal with syntactic, semantic, and structural heterogeneities commonly found in XML documents; and XLPath (XLink Path Language), a navigation language for XML documents connected by XLink. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this chapter to provide features for the LMDQL query processing and a prototype system enabling OLAP queries over XML documents linked by XLink and XML schema. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery in a relational OLAP server. In order to validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.


Author(s):  
Jingwei Cheng ◽  
Z. M. Ma ◽  
Qiang Tong

RDF plays an important role in representing Web resources in a natural and flexible way. As the amount of RDF datasets increasingly growing, storing and querying theses data have attracted the attention of more and more researchers. In this chapter, we first make a review of approaches for query processing of RDF datasets. We categorize existing methods as two classes, those making use of RDBMS to implement the storage and retrieval, and those devising their own native storage schemas. They are called Relational RDF Stores and Native Stores respectively. Secondly, we survey some important extensions of SPARQL, standard query language for RDF, which extend the expressing power of SPARQL to allow more sophisticated language constructs that meet the needs from various application scenarios.


2012 ◽  
Vol 8 (1) ◽  
pp. 52-92 ◽  
Author(s):  
Paulo Caetano da Silva ◽  
Valéria Cesário Times ◽  
Ricardo Rodrigues Ciferri ◽  
Cristina Dutra de Aguiar Ciferri

Current commercial and academic OLAP tools do not process XML data that contains XLink. Aiming at overcoming this issue, this paper proposes an analytical system composed by LMDQL, an analytical query language. Also, the XLDM metamodel is given to model cubes of XML documents with XLink and to deal with syntactic, semantic and structural heterogeneities commonly found in XML documents. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this article to provide features for the LMDQL query processing. A prototype system enabling the analytical processing of XML documents that use XLink is also detailed. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery. To validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.


Sign in / Sign up

Export Citation Format

Share Document