Semantic Search for XML Documents

2011 ◽  
Vol 48-49 ◽  
pp. 1028-1031
Author(s):  
Ling Song ◽  
Qian Gi Lv ◽  
Xiao Bing Tang

With the continuous growth in the XML data, the ability to search in massive collections of XML data becomes important. In this paper, we present efficient techniques that are able to employ bloom-filtering to decrease computation complexity that is used to filter irrelevant XML paths. After filtering, a kind of semantic measure is used to compute similarity between the query and the relevant XML documents, which is used to rank retrieval results. Experiment results show that the retrieval prototype system based on bloom-filtering runs faster than ever under the almost same average precise.

Author(s):  
Paulo Caetano da Silva

Analytical processing (OLAP) tools typically only deal with relational data. Hence, the analytical processing systems on XML data do not have all the functionality provided by OLAP tools to traditional data (i.e. relational). In addition, current commercial and academic OLAP tools do not process XML data that contains XLink. Therefore, there is a need to develop a solution for OLAP systems in order to assist in the strategic analysis of the organizational data represented in XML format. Aiming at overcoming this issue, this chapter proposes an analytical system composed by LMDQL (Link-Based Multidimensional Query Language), an analytical query language; XLDM (XLink Data Metamodel), a metamodel given to model cubes of XML documents with XLink and to deal with syntactic, semantic, and structural heterogeneities commonly found in XML documents; and XLPath (XLink Path Language), a navigation language for XML documents connected by XLink. As current W3C query languages for navigating in XML documents do not support XLink, XLPath is discussed in this chapter to provide features for the LMDQL query processing and a prototype system enabling OLAP queries over XML documents linked by XLink and XML schema. This prototype includes a driver, named sql2xquery, which performs the mapping of SQL queries into XQuery in a relational OLAP server. In order to validate the proposed system, a case study and its performance evaluation are presented to analyze the impact of analytical processing over XML/XLink documents.


Author(s):  
Alessandro Campi

This Chapter describes a visual framework; called XQBE; that covers the most important aspects of XML data management; spanning the visualization of XML documents; the formulation of queries; the representation and specification of document schemata; the definition of integrity constraints; the formulation of updates; and the expression of reactive behaviors in response to data modifications. All these features are strongly unified by a common visual abstraction and a few recurrent paradigms; so as to provide a homogeneous and comprehensive environment that allows even users without advanced programming skills to deal with nontrivial XML data management and transformation tasks. The intrinsic ambiguity inherent in any visual representation of richly expressive languages required a considerable effort of formalization in the semantics of XQBE that eventually lead to a solution with major advantages in terms of intuitiveness. In other words; this means that the unique (and unambiguous) effect of a statement is the one the user would expect.


2019 ◽  
Vol 18 (04) ◽  
pp. 1950048
Author(s):  
Amjad Qtaish ◽  
Mohammad T. Alshammari

Extensible Markup Language (XML) has become a common language for data interchange and data representation in the Web. The evolution of the big data environment and the large volume of data which is being represented by XML on the Web increase the challenges in effectively managing such data in terms of storing and querying. Numerous solutions have been introduced to store and query XML data, including the file systems, Object-Oriented Database (OODB), Native XML Database (NXD), and Relational Database (RDB). Previous research attempts indicate that RDB is the most powerful technology for managing XML data to date. Because of the structure variations of XML and RDB, the need to map XML data to an RDB scheme is increased. This growth has prompted numerous researchers and database vendors to propose different approaches to map XML documents to an RDB, translating different types of XPath queries to SQL queries and returning the results to an XML format. This paper aims to comprehensively review most cited and latest mapping approaches and database vendors that use RDB solution to store and query XML documents, in a narrative manner. The advantages and the drawbacks of each approach is discussed, particularly in terms of storing and querying. The paper also provides some insight into managing XML documents using RDB solution in terms of storing and querying and contributes to the XML community.


Author(s):  
Pasquale De Meo ◽  
Giacomo Fiumara ◽  
Antonino Nocera ◽  
Domenico Ursino

In recent years, there has been an increase in the volume and heterogeneity of XML data sources. Moreover, these information sources are often comprised of both schemas and instances of XML data. In this context, the need of grouping similar XML documents together has led to an increasing research on clustering algorithms for XML data. In this chapter, we present an overview of the most popular methods for clustering XML data sources, distinguishing between the intensional data level and the extensional data level, depending whether the sources to cluster are DTDs and XML schemas, or XML documents; in the latter case, we focus on the structural information of the documents. We classify and describe techniques for computing similarities among XML data sources, and discuss methods for clustering DTDs/XML schemas and XML documents.


Author(s):  
Ibrahim Dweib ◽  
Joan Lu

In this chapter, the authors characterize a new model for mapping XML documents into relational database. The model examines the problem of solving the structural hole between ordered hierarchical XML and unordered tabular relational database to enable use of the relational database systems for storing, updating, and querying XML data. The authors introduce and implement a mapping system called MAXDOR to solve the problem.


2006 ◽  
Vol 22 (01) ◽  
pp. 9-14
Author(s):  
Wen-Yen Chien ◽  
Heiu-Jou Shaw ◽  
Shen-Ming Wang

In this article, Taiwan Ship Net is proposed to be a portal Web site of the Taiwan yachtindustry. By using Microsoft Distributed interNet Application (DNA) and Active ServicePage (ASP), the supply chain management system, consisting of three sections, shipyards, suppliers, and administrator, exchanges and transfers the information ofenterprise's system. Through applying XML to conduct the information exchange, weintegrate the formats of orders for several yacht manufacturers and suppliers, develop the system of placing orders online, and edit XML documents to exchange the information in the system. E-catalogs that are easy to preserve and update will replace the traditional paper catalogs hereafter. The various demands of accessories will result in an increase in the costs of production and logistics. Through this system, yacht manufacturers can search the material of E-catalogs and purchase online directly. Meanwhile, it will allow the suppliers to update the E-catalogs regularly to be more competitive in the market. Taiwan Ship Net can provide the newest information to foreign customers and communicate with all yacht companies, equipment suppliers, and manufacturers in Taiwan.


2014 ◽  
Vol 56 (6) ◽  
Author(s):  
Juraj Somorovsky

AbstractXML Encryption and XML Signature describe how to apply encryption and signing algorithms to XML documents. These specifications are implemented in a wide range of systems and frameworks processing sensitive data, including banking, eGovernment, eCommerce, military, and eHealth infrastructures. The article presents practical and highly critical attacks which allow to forge signed XML documents or reveal contents of encrypted XML data.


Author(s):  
Angela Bonifati ◽  
Alfredo Cuzzocrea

This chapter presents XP2P (XPath for P2P), a framework for fragmenting and managing XML data over structured peer-to-peer networks. XP2P is characterized by an innovative mechanism for fragmenting XML documents based on meaningful XPath queries, and novel fingerprinting techniques for indexing and looking-up distributed fragments based on Chord’s DHT. Efficient algorithms for querying distributed fragments over peer-to-peer networks are also presented and experimentally assessed against both synthetic and real XML data sets. A comprehensive analysis of future research directions on XML data management over peer-to-peer networks completes the contribution of the chapter.


Author(s):  
Mohammed Adam Ibrahim Fakharaldien ◽  
Jasni Mohamed Zain ◽  
Norrozila Sulaiman ◽  
Tutut Herawan

Storing XML documents in a relational database is a promising solution because relational databases are mature and scale very well. They have the advantages that in a relational database XML data and structured data can coexist making it possible to build application that involve both kinds of data with little extra effort. This paper proposes an alternative method named Xrecursive for mapping XML (eXtensible Markup Language) documents to RDB (Relational Databases). The Xrecursive method does not need a DTD (Document Text Definition) or XML schema. Further, it can be applied as a general solution for any XML data. The steps and algorithm of Xrecursive are given in details to describe how to use the storing structure to storage and query XML documents in relational database. The authors report their experimental results on a real database, showing that the performance of their Xrecursive algorithm achieves better results in terms of storage size, insertion time, mapping time, and reconstruction time as compared with that SUCXENT and XParent methods. In overall, Xrecursive performs better in term of query performances as compared to the both methods.


Sign in / Sign up

Export Citation Format

Share Document