XML Benchmarking

Author(s):  
Irena Mlynkova

Since XML technologies have become a standard for data representation, numerous methods for processing XML data emerge every day. Consequently, it is necessary to compare the newly proposed methods with the existing ones, as well as analyze the effect of a particular method when applied to various types of data. In this chapter, the auhtors provide an overview of existing approaches to XML benchmarking from the perspective of various applications and show that to date the problem has been highly marginalized. Therefore, in the second part of the chapter they discuss persisting open issues and their possible solutions.

Author(s):  
Stéphane Bressan ◽  
Wee Hyong Tok ◽  
Xue Zhao

Since XML technologies have become a standard for data representation, a great amount of discussion has been generated by the persisting open issues and their possible solutions. In this chapter, the authors consider the design space for XML query processing techniques that can handle ad hoc and continuous XPath or XQuery queries over XML data streams. This chapter presents the state-of-art techniques in continuous and progressive XML query processing. They also discuss several open issues and future trends.


Author(s):  
Mary Ann Malloy ◽  
Irena Mlynkova

As XML technologies have become a standard for data representation, it is inevitable to propose and implement efficient techniques for managing XML data. A natural alternative is to exploit tools and functions offered by relational database systems. Unfortunately, this approach has many detractors, especially due to inefficiency caused by structural differences between XML data and relations. But, on the other hand, relational databases represent a mature, verified and reliable technology for managing any kind of data including XML documents. In this chapter, the authors provide an overview and classification of existing approaches to XML data management in relational databases. They view the problem from both state-of-the-practice and state-of-the-art perspectives. The authors describe the current best known solutions, their advantages and disadvantages. Finally, they discuss some open issues and their possible solutions.


Author(s):  
Barbara Catania ◽  
Elena Ferrari

Web is characterized by a huge amount of very heterogeneous data sources, that differ both in media support and format representation. In this scenario, there is the need of an integrating approach for querying heterogeneous Web documents. To this purpose, XML can play an important role since it is becoming a standard for data representation and exchange over the Web. Due to its flexibility, XML is currently being used as an interface language over the Web, by which (part of) document sources are represented and exported. Under this assumption, the problem of querying heterogeneous sources can be reduced to the problem of querying XML data sources. In this chapter, we first survey the most relevant query languages for XML data proposed both by the scientific community and by standardization committees, e.g., W3C, mainly focusing on their expressive power. Then, we investigate how typical Information Retrieval concepts, such as ranking, similarity-based search, and profile-based search, can be applied to XML query languages. Commercial products based on the considered approaches are then briefly surveyed. Finally, we conclude the chapter by providing an overview of the most promising research trends in the fields.


Author(s):  
E.K.H Salje ◽  
E Artacho ◽  
K.F Austen ◽  
R.P Bruin ◽  
M Calleja ◽  
...  

We review the work carried out within the e Minerals project to develop eScience solutions that facilitate a new generation of molecular-scale simulation work. Technological developments include integration of compute and data systems, developing of collaborative frameworks and new researcher-friendly tools for grid job submission, XML data representation, information delivery, metadata harvesting and metadata management. A number of diverse science applications will illustrate how these tools are being used for large parameter-sweep studies, an emerging type of study for which the integration of computing, data and collaboration is essential.


Author(s):  
A. Bagnasco ◽  
M. Chirico ◽  
A.M. Scapolla ◽  
E. Amodei
Keyword(s):  

2019 ◽  
Vol 18 (04) ◽  
pp. 1950048
Author(s):  
Amjad Qtaish ◽  
Mohammad T. Alshammari

Extensible Markup Language (XML) has become a common language for data interchange and data representation in the Web. The evolution of the big data environment and the large volume of data which is being represented by XML on the Web increase the challenges in effectively managing such data in terms of storing and querying. Numerous solutions have been introduced to store and query XML data, including the file systems, Object-Oriented Database (OODB), Native XML Database (NXD), and Relational Database (RDB). Previous research attempts indicate that RDB is the most powerful technology for managing XML data to date. Because of the structure variations of XML and RDB, the need to map XML data to an RDB scheme is increased. This growth has prompted numerous researchers and database vendors to propose different approaches to map XML documents to an RDB, translating different types of XPath queries to SQL queries and returning the results to an XML format. This paper aims to comprehensively review most cited and latest mapping approaches and database vendors that use RDB solution to store and query XML documents, in a narrative manner. The advantages and the drawbacks of each approach is discussed, particularly in terms of storing and querying. The paper also provides some insight into managing XML documents using RDB solution in terms of storing and querying and contributes to the XML community.


For the ability to represent data from a wide variety of sources, XML is rapidly emerging as the new standard for data representation and exchange on Web and e-government. To effectively use XML data in practice, entity resolution, which has been proven extremely useful in data fusion, inconsistency detection, and data repairing, must be in place to improve the quality of the XML data. In this chapter, the authors deal specifically with object identification on XML data, the application of which includes XML document management in highly dynamic applications like the Web and peer-to-peer systems, detection of duplicate elements in nested XML data, and finding similar identities among objects from multiple Web sources. The authors survey techniques of pairwise and groupwise entity resolution for XML data, which adopt structured information to describe the similarity or distance of XML data, like XML document and XML elements in document, and find the matching pairs which describe same object or classify them into separate groups, each group corresponding to the same object in real world. There are a lot of ways to describe the XML structure and content, such as a tree, Bayesian network, and set. The authors introduce some well-known algorithm base on these structures to solve matching XML data problems. Finally, the authors discuss directions for future research.


2010 ◽  
Vol 26 (6) ◽  
pp. 583-595 ◽  
Author(s):  
Dušan N Šormaz ◽  
Jaikumar Arumugam ◽  
Ramachandra S Harihara ◽  
Chintankumar Patel ◽  
Narender Neerukonda

2020 ◽  
Vol 8 (6) ◽  
pp. 2907-2910

This paper presents an approach to create a XML-based Syllabus repository for Computer Science subjects. It facilitates educators and experts to create, store, update and publish a syllabus for a particular subject. The objective of the research is to contribute to automate the Computer Science syllabus creation process allowing use, reuse and repurpose of the learning objects from the repository. The Syllabus learning objects like topic and subtopic are stored in a hierarchical XML structure which are combined and aggregated to create a customized syllabus for a particular subject. This paper discussing the Structuring, Navigating and Parsing of XML data done by using XML Schema Definitions (XSD), XPath and SimpleXML respective XML technologies. The steps to process the XML data and transform the data to produce the output is also discussed. We have tried to solve the issues associated with the traditional method of creating a syllabus which uses MS-Word or PDF data format.


Sign in / Sign up

Export Citation Format

Share Document