xml schema Latest Research Papers

Efficient processing of complex XSD using Hive and Spark

PeerJ Computer Science ◽

10.7717/peerj-cs.652 ◽

2021 ◽

Vol 7 ◽

pp. e652

Author(s):

Diana Martinez-Mosquera ◽

Rosa Navarrete ◽

Sergio Luján-Mora

Keyword(s):

Big Data ◽

Performance Management ◽

Mobile Networks ◽

Real Life ◽

Real Data ◽

Xml Schema ◽

Apache Spark ◽

Data Sets ◽

Apache Hive

The eXtensible Markup Language (XML) files are widely used by the industry due to their flexibility in representing numerous kinds of data. Multiple applications such as financial records, social networks, and mobile networks use complex XML schemas with nested types, contents, and/or extension bases on existing complex elements or large real-world files. A great number of these files are generated each day and this has influenced the development of Big Data tools for their parsing and reporting, such as Apache Hive and Apache Spark. For these reasons, multiple studies have proposed new techniques and evaluated the processing of XML files with Big Data systems. However, a more usual approach in such works involves the simplest XML schemas, even though, real data sets are composed of complex schemas. Therefore, to shed light on complex XML schema processing for real-life applications with Big Data tools, we present an approach that combines three techniques. This comprises three main methods for parsing XML files: cataloging, deserialization, and positional explode. For cataloging, the elements of the XML schema are mapped into root, arrays, structures, values, and attributes. Based on these elements, the deserialization and positional explode are straightforwardly implemented. To demonstrate the validity of our proposal, we develop a case study by implementing a test environment to illustrate the methods using real data sets provided from performance management of two mobile network vendors. Our main results state the validity of the proposed method for different versions of Apache Hive and Apache Spark, obtain the query execution times for Apache Hive internal and external tables and Apache Spark data frames, and compare the query performance in Apache Hive with that of Apache Spark. Another contribution made is a case study in which a novel solution is proposed for data analysis in the performance management systems of mobile networks.

DIN 4000-102:2021-05, Sachmerkmal-Listen_- Teil_102: Datenaustausch für Sachmerkmallisten mittels XML-Schema

10.31030/3236417 ◽

2021 ◽

Keyword(s):

Xml Schema

Crossref 4.4.2 XML Elements and Attributes

Terry's Archive Online ◽

10.48034/20210426 ◽

2021 ◽

Vol 2021 (04) ◽

pp. 0426

Author(s):

Terry Bollinger

Keyword(s):

Full Range ◽

Xml Schema ◽

Web Pages ◽

Small Publisher ◽

The Creation ◽

Recent Version ◽

Document Page

For anyone trying to understand both the basics and the full range of options available when making a DOI metadata submission to Crossref, this linked table of XML element and attribute descriptions gives one small publisher’s best understanding of the most recent version of Crossref’s metadata submission elements and attributes. As of April 2021, the most recent version of Crossref XML files is 4.4.2. This table provides definitions for the six Crossref XML Schema Definition (xsd) files that include the most commonly used description elements of a DOI submission: crossref4.4.2.xsd, common4.4.2.xsd, fundref.xsd, AccessIndicators.xsd, clinicaltrials.xsd, and relations.xsd. The table also includes a brief description of the main features of the externally defined jats:abstract (JATS) element. This table focuses not on XML syntax but on the intent and structure of the elements from a small publisher perspective. This table is one small publisher’s interpretation of Crossref XML and is not authoritative in any way. It will inevitably contain errors, and the author takes no responsibility for its use, which is necessarily and entirely at your own risk. Any submissions created with information from this table should be verified for correctness against the official automated documentation and tools at the Crossref submission site. Note, however, that occasional errors and inconsistencies in those Crossref XML files were uncovered during the creation of this table. Every effort has been made here both to document inconsistencies in the original files and in this interpretation of those files. Important links to Crossref documentation, including comment on the apparent status of Crossref web pages, are provided in the References section after the table on the last document page.

Appendix A: XML Schema Document for The Bee Corp’s Method of Hive Strength Assessment for Pollination Effectiveness

10.51269/mvuz7860 ◽

2021 ◽

Author(s):

Ellie Symes ◽

◽

Joseph Cazier ◽

Keyword(s):

Xml Schema ◽

Strength Assessment ◽

Pollination Effectiveness

Methods for Formalizing Cognitive Graphics and Visual Models using XML Schemas

Herald of the Bauman Moscow State Technical University Series Instrument Engineering ◽

10.18698/0236-3933-2021-1-51-77 ◽

2021 ◽

pp. 51-77

Author(s):

A.I. Vlasov ◽

L.V. Zhuravleva ◽

V.V. Kazakov

Keyword(s):

Complex Systems ◽

Data Storage ◽

Process Model ◽

Xml Schema ◽

Process Models ◽

Levels Of Abstraction ◽

Visual Model ◽

Visual Models ◽

Development Processes ◽

Cognitive Graphics

The paper analyses methods of formalising cognitive graphics and visual models using promising data storage formats. We describe the primary visual design techniques and note that they appear to be rather disconnected. We show that ensuring the coupling of data and knowledge in visual models featuring various levels of detail is the main problem in integrated usage of visual modelling tools. We analyse approaches to solving the semantic discontinuity problem, that is, provided we meet the condition under which the properties of objects, systems and processes under consideration are only input once, it is necessary to ensure that data from models corresponding to different levels of abstraction (expertise) is interconnected. One should assume that the main drawback of existing approaches to visualising complex systems is that these approaches are fragmented and isolated, which means that they will only be effective locally. The paper proposes several approaches to formalising visual models employing XML schemas, which ensures that development processes concerning visual models of various levels of abstraction are synchronised and interconnected. We use a BPMN (Business, Process, Model and Notation) visual model as an example that shows the principles of representing visual model elements by means of XML schemas. The paper provides a detailed analysis of the principles behind layer interaction in the BPMN model through flexible XML description. We show that the BPMN data structure boasts its own XML schema containing all the parameters of a class or an element. The paper presents several examples and a technique of applying an XML schema to a BPMN model, including a further generalisation aimed at formally representing the process models of complex systems

Querying multi-source heterogeneous fuzzy spatiotemporal data

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-202357 ◽

2021 ◽

pp. 1-12

Author(s):

Luyi Bai ◽

Nan Li ◽

Lishuang Liu ◽

Xuesong Hao

Keyword(s):

Integration Method ◽

State Of The Art ◽

Rapid Development ◽

Structural Heterogeneity ◽

Xml Schema ◽

Data Sources ◽

Spatiotemporal Data ◽

Relational Data ◽

Semantic Model ◽

Semantic Models

With the rapid development of the environmental, meteorological and marine data management, fuzzy spatiotemporal data has received considerable attention. Even though some achievements in querying aspect have been made, there are still some unsolved problems. Semantic and structural heterogeneity may exist among different data sources, which will lead to incomplete results. In addition, there are ambiguous query intentions and conditions when the user queries the data. This paper proposes a fuzzy spatiotemporal data semantic model. Based on this model, the RDF local semantic models are converted into a RDF global semantic model after mapping relational data and XML data to RDF local semantic models. The existing methods mainly convert relational data to RDF Schema directly. But our approach converts relational data to XML Schema and then converts it to RDF, which utilizes the semi-structured feature of XML schema to solve the structural heterogeneity between different data sources. The integration process enables us to perform global queries against different data sources. In the proposed query algorithms, the query conditions inputted are converted into exact queries before the results are returned. Finally, this paper has carried out extensive experiments, calculated the recall, precision and F-Score of the experimental results, and compared with other state-of-the-art query methods. It shows the importance of the data integration method and the effectiveness of the query method proposed in this paper.

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

Frontiers of Structural and Civil Engineering ◽

10.1007/s11709-020-0666-8 ◽

2021 ◽

Author(s):

Sang I. Park ◽

Sang-Ho Lee

Keyword(s):

Decision Tree ◽

Xml Schema ◽

Decision Tree Model ◽

Schema Matching ◽

Tree Model ◽

Heuristic Solution ◽

Structural Calculation

An XML Schema for the Controlling Multiple Streams for Telepresence (CLUE) Data Model

10.17487/rfc8846 ◽

2021 ◽

Author(s):

R. Presta ◽

Keyword(s):

Data Model ◽

Xml Schema ◽

Multiple Streams

Evaluation of XML Schema Support in Knowledge Management

Frontiers in Artificial Intelligence and Applications - Information Modelling and Knowledge Bases XXXII ◽

10.3233/faia200826 ◽

2020 ◽

Author(s):

Boštjan Šumak ◽

Marjan Heričko ◽

Maja Pušnik

Keyword(s):

Knowledge Management ◽

Data Quality ◽

Data Structures ◽

Xml Schema ◽

Structured Data ◽

Data Organization ◽

Management Process ◽

Efficient Management ◽

Competitive Economy

Well organized data contributes extensively to the classification possibilities and quality of Knowledge Management. XML schemas play an important role in data organization activities, and provide basic foundations for companies and organizations dealing with large amounts of data. In times where knowledge represents the greatest advantage in a competitive economy and is relatively simple to find through different web providers, the quality of internal data structures and efficient management of a company’s valuable information is of the utmost importance. XML schemas are one of the mechanisms that can provide a data organization system in a qualitative manner, and efficient knowledge management as soon as data have been defined or accumulated. A good XML schema support is a way to increase the competitiveness of an organization by ensuring structured data quality and simplifying the Knowledge Management process.

Design and Implementation of XML Schema Based Information System

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9326772 ◽

2020 ◽

Author(s):

Zheng Cheng ◽

Jiaju Wu ◽

Quangeng chen ◽

Yongqi Ma

Keyword(s):

Information System ◽

Xml Schema ◽

Design And Implementation

xml schema
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Efficient processing of complex XSD using Hive and Spark

DIN 4000-102:2021-05, Sachmerkmal-Listen_- Teil_102: Datenaustausch für Sachmerkmallisten mittels XML-Schema

Crossref 4.4.2 XML Elements and Attributes

Appendix A: XML Schema Document for The Bee Corp’s Method of Hive Strength Assessment for Pollination Effectiveness

Methods for Formalizing Cognitive Graphics and Visual Models using XML Schemas

Querying multi-source heterogeneous fuzzy spatiotemporal data

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

An XML Schema for the Controlling Multiple Streams for Telepresence (CLUE) Data Model

Evaluation of XML Schema Support in Knowledge Management

Design and Implementation of XML Schema Based Information System

Export Citation Format

xml schemaRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Efficient processing of complex XSD using Hive and Spark

DIN 4000-102:2021-05, Sachmerkmal-Listen_- Teil_102: Datenaustausch für Sachmerkmallisten mittels XML-Schema

Crossref 4.4.2 XML Elements and Attributes

Appendix A: XML Schema Document for The Bee Corp’s Method of Hive Strength Assessment for Pollination Effectiveness

Methods for Formalizing Cognitive Graphics and Visual Models using XML Schemas

Querying multi-source heterogeneous fuzzy spatiotemporal data

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

An XML Schema for the Controlling Multiple Streams for Telepresence (CLUE) Data Model

Evaluation of XML Schema Support in Knowledge Management

Design and Implementation of XML Schema Based Information System

xml schema
Recently Published Documents