FUZZY DATA REPRESENTATION AND QUERYING IN XML DATABASE

Ever since the Extensible Markup Language (XML) (W3C, 1998b) began to be used to exchange data between diverse sources, interest has grown in deploying data management technology to store and query XML documents. A number of approaches propose to adapt relational database technology to store and maintain XML documents (Deutsch, Fernandez & Suciu, 1999; Florescu & Kossmann, 1999; Klettke & Meyer, 2000; Shanmugasundaram et al., 1999; Tatarinov et al., 2002; O’Neil et al., 2004). The advantage is that the XML repository inherits all the power of mature relational technology like indexes and transaction management. For XML-enabled querying, a declarative query language (Chamberlin et al., 2001) is available.

Download Full-text

XML Schema Integration and E-Commerce

Electronic Commerce ◽

10.4018/978-1-59904-943-4.ch026 ◽

2011 ◽

pp. 286-291

Author(s):

Kalpdrum Passi ◽

Louise Lane ◽

Sanjay Madria ◽

Mukesh Mohania

Keyword(s):

Incomplete Data ◽

Xml Schema ◽

Structured Data ◽

Markup Language ◽

Schema Integration ◽

Semantic Meaning ◽

Data Set ◽

Xml Documents ◽

Xml Document ◽

Extensible Markup

XML (eXtensible Markup Language) is used to describe semi-structured data, i.e., irregular or incomplete data whose structure may be subject to unpredictable changes. Unlike traditional semi-structured data, XML documents are self-describing, thus XML provides a platform-independent means to describe data and, therefore, can transport data from one platform to another (Bray, Paoli, & Sperberg-McQueen, 1998). XML documents can be both created and used by applications. The valid content, allowed structure, and metadata properties of XML documents are described by their related schema(s) (Thompson, Beech, Maloney, & Mendelsohn, 2001). An XML document is said to be valid if it conforms to its related schema. A schema also gives additional semantic meaning to the data it is used to tag. The schema is provided independently of the data it describes. Any given data set may rely on multiple schemas for validation. Any given schema may itself refer to multiple schemas.

Download Full-text

XML Schema Integration and E-Commerce

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch555 ◽

2005 ◽

pp. 3118-3121

Author(s):

Kalpdrum Passi ◽

Louise Lane ◽

Sanjay Madria ◽

Mukesh Mohania

Keyword(s):

Incomplete Data ◽

Xml Schema ◽

Structured Data ◽

Markup Language ◽

Schema Integration ◽

Semantic Meaning ◽

Data Set ◽

Xml Documents ◽

Xml Document ◽

Extensible Markup

XML (eXtensible Markup Language) is used to describe semi-structured data, i.e., irregular or incomplete data whose structure may be subject to unpredictable changes. Unlike traditional semi-structured data, XML documents are self-describing, thus XML provides a platform-independent means to describe data and, therefore, can transport data from one platform to another (Bray, Paoli, & Sperberg-McQueen, 1998). XML documents can be both created and used by applications. The valid content, allowed structure, and metadata properties of XML documents are described by their related schema(s) (Thompson, Beech, Maloney, & Mendelsohn, 2001). An XML document is said to be valid if it conforms to its related schema. A schema also gives additional semantic meaning to the data it is used to tag. The schema is provided independently of the data it describes. Any given data set may rely on multiple schemas for validation. Any given schema may itself refer to multiple schemas.

Download Full-text

Book Review: Coding with XML for Efficiencies in Cataloging and Metadata

Library Resources and Technical Services ◽

10.5860/lrts.63n1.76 ◽

2019 ◽

Vol 63 (1) ◽

pp. 76

Author(s):

Lisa Lorenzo

Keyword(s):

Xml Schema ◽

Annual Conference ◽

Markup Language ◽

Bibliographic Data ◽

Extensible Markup ◽

Hypertext Markup Language ◽

Exhaustive List ◽

Comprehensive Reference

Written as a follow-up to an Association for Library Collections & Technical Services (ALCTS) pre-conference held during the 2015 American Library Association (ALA) Annual Conference, Coding with XML for Efficiencies in Cataloging and Metadata is an excellent introduction to the potential of Extensible Markup Language (XML) and related technologies in creating efficiencies in library cataloging and metadata work. As stated in the volume’s introduction, this guide will be most useful to those with some familiarity with XML or Hypertext Markup Language (HTML). However, this is not a requirement as the first chapter introduces XML in a way that will bring most newcomers up to speed. Catalogers will likely benefit most from this volume given that the majority of examples involve MARC 21 bibliographic data. However, metadata managers more broadly will also find value here, particularly in sections on XML Schema Definition Language (XSD), Extensible Stylesheet Language for Transformation (XSLT), XPath, and XQuery. Coding with XML does not purport to be a comprehensive reference for all XML work in libraries, but more of “a tutorial on its subject” (3). This is certainly true—each section provides an approachable and thorough introduction to a particular technology rather than an exhaustive list of features. There is also a useful list of resources at the end of the book for readers who want more information and more in-depth examples.

Download Full-text

A Narrative Review of Storing and Querying XML Documents Using Relational Database

Journal of Information & Knowledge Management ◽

10.1142/s0219649219500485 ◽

2019 ◽

Vol 18 (04) ◽

pp. 1950048

Author(s):

Amjad Qtaish ◽

Mohammad T. Alshammari

Keyword(s):

Relational Database ◽

File Systems ◽

Data Representation ◽

Xml Data ◽

Xml Documents ◽

Data Interchange ◽

Extensible Markup ◽

Data Environment ◽

Object Oriented Database ◽

The Web

Extensible Markup Language (XML) has become a common language for data interchange and data representation in the Web. The evolution of the big data environment and the large volume of data which is being represented by XML on the Web increase the challenges in effectively managing such data in terms of storing and querying. Numerous solutions have been introduced to store and query XML data, including the file systems, Object-Oriented Database (OODB), Native XML Database (NXD), and Relational Database (RDB). Previous research attempts indicate that RDB is the most powerful technology for managing XML data to date. Because of the structure variations of XML and RDB, the need to map XML data to an RDB scheme is increased. This growth has prompted numerous researchers and database vendors to propose different approaches to map XML documents to an RDB, translating different types of XPath queries to SQL queries and returning the results to an XML format. This paper aims to comprehensively review most cited and latest mapping approaches and database vendors that use RDB solution to store and query XML documents, in a narrative manner. The advantages and the drawbacks of each approach is discussed, particularly in terms of storing and querying. The paper also provides some insight into managing XML documents using RDB solution in terms of storing and querying and contributes to the XML community.

Download Full-text

On an Enhancement of XML Applied for Mobile E-Commerce

Journal of Electronic Commerce in Organizations ◽

10.4018/jeco.2012070102 ◽

2012 ◽

Vol 10 (3) ◽

pp. 13-26

Author(s):

Xiaomin Zhu ◽

Zhongxiang He ◽

Shengbo Shi

Keyword(s):

Data Processing ◽

Mobile Computing ◽

Web Service ◽

Size Effects ◽

Processing Time ◽

The Internet ◽

Markup Language ◽

Mobile Web ◽

Xml Documents ◽

Extensible Markup

Extensible Markup Language (XML) is a textual markup language which becomes more and more important in the Internet web service. However, some distinct disadvantages exist in XML, such as its nature of redundancy, which consumes the limited network’s bandwidth greatly especially in mobile computing. Considering the characteristics of the mobile commerce, the handsets’ memory capability and data processing time are two problems for XML being applied. This paper studies an enhancement of XML for the purpose of application in mobile e-commerce, called SXML, which means Simple XML to enhance the XML used in mobile web service. It helps XML producers minimizing the size effects of XML, e.g., the size overhead and slow implementation speed. Comprehensive simulations show that the SXML could reduce the size of XML documents and reduce the time of implementation, consequently utilize the bandwidth effectively.

Download Full-text

Mining Association Rules from XML Documents

Enterprise Information Systems ◽

10.4018/978-1-61692-852-0.ch321 ◽

2011 ◽

pp. 879-899

Author(s):

Laura Irina Rusu ◽

Wenny Rahayu ◽

David Taniar

Keyword(s):

Knowledge Discovery ◽

Association Rules ◽

Web Application ◽

Semistructured Data ◽

Markup Language ◽

Xml Documents ◽

Rapid Changes ◽

Extensible Markup ◽

Hidden Knowledge ◽

The Web

This chapter presents some of the existing mining techniques for extracting association rules out of XML documents in the context of rapid changes in the Web knowledge discovery area. The initiative of this study was driven by the fast emergence of XML (eXtensible Markup Language) as a standard language for representing semistructured data and as a new standard of exchanging information between different applications. The data exchanged as XML documents become richer and richer every day, so the necessity to not only store these large volumes of XML data for later use, but to mine them as well to discover interesting information has became obvious. The hidden knowledge can be used in various ways, for example, to decide on a business issue or to make predictions about future e-customer behaviour in a Web application. One type of knowledge that can be discovered in a collection of XML documents relates to association rules between parts of the document, and this chapter presents some of the top techniques for extracting them.

Download Full-text

Querying XML documents in logic programming

Theory and Practice of Logic Programming ◽

10.1017/s1471068407003183 ◽

2008 ◽

Vol 8 (3) ◽

pp. 323-361 ◽

Cited By ~ 12

Author(s):

J. M. ALMENDROS-JIMÉNEZ ◽

A. BECERRA-TERÓN ◽

F. J. ENCISO-BAÑOS

Keyword(s):

Logic Programming ◽

Large Scale ◽

Query Language ◽

Electronic Publishing ◽

Logic Program ◽

Main Memory ◽

Secondary Memory ◽

Xml Documents ◽

Xml Document ◽

Extensible Markup

AbstractExtensible Markup Language (XML) is a simple, very flexible text format derived from SGML. Originally designed to meet the challenges of large-scale electronic publishing, XML is also playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. XPath language is the result of an effort to provide address parts of an XML document. In support of this primary purpose, it becomes in a query language against an XML document. In this paper we present a proposal for the implementation of the XPath language in logic programming. With this aim we will describe the representation of XML documents by means of a logic program. Rules and facts can be used for representing the document schema and the XML document itself. In particular, we will present how to index XML documents in logic programs: rules are supposed to be stored in main memory, however facts are stored in secondary memory by using two kind of indexes: one for each XML tag, and other for each group of terminal items. In addition, we will study how to query by means of the XPath language against a logic program representing an XML document. It evolves the specialization of the logic program with regard to the XPath expression. Finally, we will also explain how to combine the indexing and the top-down evaluation of the logic program.

Download Full-text

An Approach to Extracting Knowledge From Legacy Documents

Volume 4: 24th Computers and Information in Engineering Conference ◽

10.1115/detc2004-57677 ◽

2004 ◽

Author(s):

Richard Crowder ◽

Yee-Wie Sim

Keyword(s):

Engineering Design ◽

Human Resource ◽

Extensible Markup Language ◽

Markup Language ◽

Design Environment ◽

Text Search ◽

Xml Documents ◽

Extensible Markup ◽

Access To Data ◽

Efficiency And Reliability

Organisations are increasingly information intensive; hence providing access to data that is trapped in various proprietary forms including catalogues, databases, human resource systems and internally generated documents is now becoming a significant and challenging task. The authors have undertaken research into approaches to capture relevant knowledge from legacy documents. This is achieved by converting the legacy documents to XML, (eXtensible Markup Language), documents where the output is semantically tagged. Once in an XML form, the data can be easily transformed. This paper describes the development of tools to automate the process of converting legacy documents to XML documents. The purpose of this work is improve the efficiency and reliability of Expertise Finder suitable for use within an engineering design environment. We will also show that by querying the resultant XML versions of legacy documents provides better results than a basic text search over the identical documents when applied used within an Expertise Finder.

Download Full-text

Transforming data-centric eXtensible markup language into relational databases using hybrid approach

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i6.2865 ◽

2021 ◽

Vol 10 (6) ◽

pp. 3256-3264

Author(s):

Su-Cheng Haw ◽

Emyliana Song

Keyword(s):

Relational Databases ◽

Hybrid Approach ◽

Data Representation ◽

Extensible Markup Language ◽

Markup Language ◽

Seamless Integration ◽

Structural Relationships ◽

Extensible Markup ◽

Core Framework ◽

Labelling Scheme

eXtensible markup language (XML) appeared internationally as the format for data representation over the web. Yet, most organizations are still utilising relational databases as their database solutions. As such, it is crucial to provide seamless integration via effective transformation between these database infrastructures. In this paper, we propose XML-REG to bridge these two technologies based on node-based and path-based approaches. The node-based approach is good to annotate each positional node uniquely, while the path-based approach provides summarised path information to join the nodes. On top of that, a new range labelling is also proposed to annotate nodes uniquely by ensuring the structural relationships are maintained between nodes. If a new node is to be added to the document, re-labelling is not required as the new label will be assigned to the node via the new proposed labelling scheme. Experimental evaluations indicated that the performance of XML-REG exceeded XMap, XRecursive, XAncestor and Mini-XML concerning storing time, query retrieval time and scalability. This research produces a core framework for XML to relational databases (RDB) mapping, which could be adopted in various industries.

Download Full-text