Multidimensional Data Analysis Based on Links

Author(s):  
Paulo Caetano da Silva

Analytical processing (OLAP) tools typically only deal with relational data. Hence, the analytical processing systems on XML data do not have all the functionality provided by OLAP tools to traditional data (i.e. relational). In addition, current commercial and academic OLAP tools do not process XML data that contains XLink. Therefore, there is a need to develop a solution for OLAP systems in order to assist in the strategic analysis of the organizational data represented in XML format. Aiming at overcoming this issue, this chapter proposes an analytical system composed by LMDQL (Link-Based Multidimensional Query Language), an analytical query language; XLDM (XLink Data Metamodel), a metamodel given to model cubes of XML documents with XLink and to deal with syntactic, semantic, and structural heterogeneities commonly found in XML documents; and XLPath (XLink Path Language), a navigation language for XML documents connected by XLink. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this chapter to provide features for the LMDQL query processing and a prototype system enabling OLAP queries over XML documents linked by XLink and XML schema. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery in a relational OLAP server. In order to validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.

2012 ◽  
Vol 8 (1) ◽  
pp. 52-92 ◽  
Author(s):  
Paulo Caetano da Silva ◽  
Valéria Cesário Times ◽  
Ricardo Rodrigues Ciferri ◽  
Cristina Dutra de Aguiar Ciferri

Current commercial and academic OLAP tools do not process XML data that contains XLink. Aiming at overcoming this issue, this paper proposes an analytical system composed by LMDQL, an analytical query language. Also, the XLDM metamodel is given to model cubes of XML documents with XLink and to deal with syntactic, semantic and structural heterogeneities commonly found in XML documents. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this article to provide features for the LMDQL query processing. A prototype system enabling the analytical processing of XML documents that use XLink is also detailed. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery. To validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.


Author(s):  
Arijit Sengupta ◽  
Ramesh Venkataraman

This chapter introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.


2011 ◽  
Vol 48-49 ◽  
pp. 1028-1031
Author(s):  
Ling Song ◽  
Qian Gi Lv ◽  
Xiao Bing Tang

With the continuous growth in the XML data, the ability to search in massive collections of XML data becomes important. In this paper, we present efficient techniques that are able to employ bloom-filtering to decrease computation complexity that is used to filter irrelevant XML paths. After filtering, a kind of semantic measure is used to compute similarity between the query and the relevant XML documents, which is used to rank retrieval results. Experiment results show that the retrieval prototype system based on bloom-filtering runs faster than ever under the almost same average precise.


Author(s):  
V. A. Konovalov

The paper assesses the prospects for the application of the big data paradigm in socio-economic systems through the analysis of factors that distinguish it from the well-known scientific ideas of data synthesis and decomposition. The idea of extracting knowledge directly from big data is analyzed. The article compares approaches to extracting knowledge from big data: algebraic and multidimensional data analysis used in OLAP-systems (OnLine Analytical Processing). An intermediate conclusion is made about the advisability of dividing systems for working with big data into two main classes: automatic and non-automatic. To assess the result of extracting knowledge from big data, it is proposed to use well-known scientific criteria: reliability and efficiency. It is proposed to consider two components of reliability: methodical and instrumental. The main goals of knowledge extraction in socio-economic systems are highlighted: forecasting and support for making management decisions. The factors that distinguish big data are analyzed: volume, variety, velocity, as applied to the study of socio-economic systems. The expediency of introducing a universe into systems for processing big data, which provides a description of the variety of big data and source protocols, is analyzed. The impact of the properties of sample populations from big data: incompleteness, heterogeneity, and non-representativeness, the choice of mathematical methods for processing big data is analyzed. The conclusion is made about the need for a systemic, comprehensive, cautious approach to the development of fundamental decisions of a socio-economic nature when using the big data paradigm in the study of individual socio-economic subsystems.


2011 ◽  
Vol 22 (4) ◽  
pp. 30-56
Author(s):  
Arijit Sengupta ◽  
Ramesh Venkataraman

This article introduces a complete storage and retrieval architecture for a database environment for XML documents. DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include (i) a formal model called Heterogeneous Nested Relations (HNR), (ii) a conceptual model XER (Extensible Entity Relationship), (ii) formal query languages (Document Algebra and Calculus), (iii) a practical query language (Document SQL or DSQL), (iv) a visual query formulation method with QBT (Query By Templates), and (v) the DocBase query processing architecture. This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests. The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.


2003 ◽  
pp. 1-35 ◽  
Author(s):  
Nick Bassiliades ◽  
Ioannis Vlahavas ◽  
Dimitros Sampson

In this chapter, we propose the use of first-order logic, in the form of deductive database rules, as a query language for XML data, and we present X-Device, an extension of the deductive object-oriented database system Device, for storing and querying XML data. XML documents are stored into the OODB by automatically mapping the DTD to an object schema. XML elements are treated either as classes or attributes based on their complexity, without loosing the relative order of elements in the original document. Furthermore, this chapter describes the extension of the system’s deductive rule query language with second-order variables, general path and ordering expressions, for querying over the stored, tree-structured XML data and constructing XML documents as a result. The extensions were implemented by translating all the extended features into the basic, first-order deductive rule language of Device using meta-data about stored XML objects.


2019 ◽  
Vol 10 (11) ◽  
pp. 1131-1135
Author(s):  
Tomas Hambili Paulo Sanjuluca ◽  
◽  
Ricardo Correia ◽  
Anabela Antunes de Almeida ◽  
Ana Gloria Diaz Martinez ◽  
...  

Introduction: In order to have a good assessment of the quality of maternal and child health care, it is essential that there is up-to-date and reliable information. Objective: To evaluate the impact of the implementation of a computerized database of clinical processes in the admission, archive and medical statistics section, of Maternity hospital Irene Neto/Lubango-Angola. Methodology: A descriptive study with a quantitative and qualitative approach to carry out a retrospective case study deliveries and newborns, records from 2014 to 2017. Final considerations: The implementation of this project may contribute to the improvement of clinical management support management of the hospital as well as facilitating access to information for research and scientific production.


2018 ◽  
Author(s):  
Emmanuel Owusu-Kwarteng ◽  
Prince Opoku ◽  
Gershon Dagba ◽  
Mark Amankwa

Sign in / Sign up

Export Citation Format

Share Document