XML Schema Evolution and Versioning

Author(s):  
Giovanna Guerrini ◽  
Marco Mesiti

The large dynamicity of XML documents on the Web has created the need to adequately support structural changes and to account for the possibility of evolving and versioning the schemas describing XML document structures. This chapter discusses and compares the support for schema evolution and versioning provided by commercial systems as well as the most relevant approaches and prototypes proposed and developed by the research community.

Author(s):  
Abad Shah ◽  
Jacob Adeniyi ◽  
Tariq Al Tuwairqi

The Web and XML have influenced all walks of lives of those who transact business over the Internet. People like to do their transactions from their homes to save time and money. For example, customers like to pay their utility bills and other banking transactions from their homes through the Internet. Most companies, including banks, maintain their records using relational database technology. But the traditional relational database technology is unable to provide all these new facilities to the customers. To make the traditional relational database technology cope with the Web and XML technologies, we need a transformation between the XML technology and the relational database technology as middleware. In this chapter, we present a new and simpler algorithm for this purpose. This algorithm transforms a schema of a XML document into a relational database schema, taking into consideration the requirement of relational database technology.


Author(s):  
Béatrice Bouchou ◽  
Denio Duarte ◽  
Mírian Halfeld Ferrari ◽  
Martin A. Musicante

The XML Messaging Protocol, a part of the Web service protocol stack, is responsible for encoding messages in a common XML format (or type), so that they can be understood at either end of a network connection. The evolution of an XML type may be required in order to reflect new communication needs, materialized by slightly different XML messages. For instance, due to a service evolution, it might be interesting to extend a type in order to allow the reception of more information, when it is available, instead of always disregarding it. The authors’ proposal consists in a conservative XML schema evolution. The framework is as follows: administrators enter updates performed on a valid XML document in order to specify new documents expected to be valid, and the system computes new types accepting both such documents and previously valid ones. Changing the type is mainly changing regular expressions that define element content models. They present the algorithm that implements this approach, its properties and experimental results.


2014 ◽  
Vol 608-609 ◽  
pp. 401-407
Author(s):  
Su Jin Cai

Data exchange can make different information systems of enterprises realize data interaction. The objective establishing it is to keep information sharing and synchronization in enterprises, which can effectively use resources, improve the performance of the system and speeds up information circulation. Data exchange model is the core of realizing data exchange and it has a set of completed data storage and access mechanism. The paper verifies the mapping of relational schema and XML schema, the conversion of database and XML data, and XML document schema, and the paper constructs XSLT documents among different XML schemas, and explains and implements the algorithms of the conversion among different XML documents.


Author(s):  
Giovanna Guerrini ◽  
Marco Mesiti ◽  
Ismael Sanz

The large amount and heterogeneity of XML documents on the Web require the development of clustering techniques to group together similar documents. Documents can be grouped together according to their content, their structure, and links inside and among documents. For instance, grouping together documents with similar structures has interesting applications in the context of information extraction, of heterogeneous data integration, of personalized content delivery, of access control definition, of web site structural analysis, of comparison of RNA secondary structures. Many approaches have been proposed for evaluating the structural and content similarity between tree-based and vector-based representations of XML documents. Link-based similarity approaches developed for Web data clustering have been adapted for XML documents. This chapter discusses and compares the most relevant similarity measures and their employment for XML document clustering.


Author(s):  
Sun-Moon Jo ◽  
◽  
Weon-Hee Yoo ◽  

XML-based access control technology aims at providing an authorization policy that can be consistently applied to various products for access control services on Internet and different kinds of environment for the products and thus providing interoperability to the existing access control products with diverse kinds of environment and types. The existing access control fails to consider information structure and semantics sufficiently due to the fundamental limitations of HTML. In addition, XML document access control supplies only action read and permits alterations of very limited value for action write. The existing access control has disadvantages that DOM tree should be loaded on memory while all XML documents are parsed to generate DOM tree; that a lot of memory is used in repetitive search for tree to authorize access to all nodes in DOM tree; and that the complex authorization evaluation process may lower system performance. In this paper, we present an authorization mechanism for secure XML sources on the Web.


2011 ◽  
Vol 1 (1) ◽  
Author(s):  
Joe Tekli ◽  
Richard Chbeir ◽  
Agma Traina ◽  
Caetano Traina

AbstractXML document comparison is becoming an ever more popular research issue due to the increasingly abundant use of XML. Likewise, a growing interest fosters the development of XML grammar matching and comparison, due to the proliferation of heterogeneous XML data sources, particularly on the Web. Nonetheless, the process of comparing XML documents with XML grammars, i.e., XML document and grammar similarity evaluation, has not yet received the attention it deserves. In this paper, we provide an overview on existing research related to XML document/grammar comparison, presenting the background and discussing the various techniques related to the problem. We also discuss some prominent application domains, ranging over document classification and clustering, document transformation, grammar evolution, selective dissemination of XML information, XML querying, as well as alert filtering in intrusion detection systems and Web Services matching and communications.


2011 ◽  
Vol 268-270 ◽  
pp. 2002-2008
Author(s):  
Li Feng Yin ◽  
Hua Jin ◽  
Hong Tian

Data constraints are the basis of the database normalization, the introduction of XML documents with incomplete information, data constraints need to be re-defined. The precise definition of the tree inferior strong multi-valued dependency for XML documents(XTISMVD) was proposed, which was based on the tree structure and subtree information equivalent. Judging condition of XTISMVD was studied. By instance this paper shows that the existence of XTISMVD produces redundancies and update anomalies, moving or adding nodes for XML Schema can reduce redundancies.


Author(s):  
JOSEPH FONG ◽  
ANTHONY FONG ◽  
H. K. WONG ◽  
PHILIP YU

With XML adopted as the technology trend on the Internet, and with investment in the current relational database systems, companies must convert their relational data into XML documents for data transmission on the Internet. In the process, to preserve the users' relational data requirements of data constraints into the converted XML documents, we must define a meaningful root element for each XML document. The construction of an XML document is based on the root element and its relevant elements. The root element can be selected from a relational entity table in the existing relational database, which depends on the requirements to present the business behind. The relevant elements are mapped from the related entities, based on the navigability of the chosen entity. The derived root and relevant elements can form a Data Type Definition Graph (DTD-graph) of an XML conceptual schema diagram which can be mapped into a Data Type Definition (DTD) of an XML schema. The result is a translated XML schema with semantic constraints transferred from a relational conceptual schema of an Extended Entity Relationship (EER) model. The data conversion from relational data to the XML documents can be done after the schema translation. The relational data are loaded into XML documents according to the translated DTD.


Author(s):  
Michal Kozielski

Information on the hierarchical nature of XML data is essential in tasks of learning from XML document structures. Within this view, XML documents can be regarded as multi-represented data, which is the case when multiple representations of the document correspond to the generation of features at each structure level of the document separately. This chapter raises the importance of using fuzzy approaches to clustering XML document structures, since these approaches are shown to be effective in combining the information coming from different document representations that correspond to different hierarchy levels. For this purpose, we overview fuzzy encoding and similarity methods and present fuzzy clustering approaches which are particularly suited for being extended to handle XML document structures. We propose two different scenarios of fuzzy clustering of XML structures, which aim to either encode the document structure hierarchy using a fuzzy bag model or to specifically handle the multi-representation of the documents.


Sign in / Sign up

Export Citation Format

Share Document