Efficiently Coding and Indexing XML Document

XML document information retrieval model based on four-layered Bayesian network

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02791 ◽

2009 ◽

Vol 29 (10) ◽

pp. 2791-2795

Author(s):

Xiao-long ZHANG ◽

Xing-chen HENG

Keyword(s):

Information Retrieval ◽

Bayesian Network ◽

Retrieval Model ◽

Model Based ◽

Xml Document

Download Full-text

Automatic generation of MPEG-7 compliant XML document for motion trajectory descriptor in sports video

Proceedings of the first ACM international workshop on Multimedia databases - MMDB 2003 ◽

10.1145/951676.951680 ◽

2003 ◽

Cited By ~ 1

Author(s):

Yi Haoran ◽

Deepu Rajan ◽

Chia Liang-Tien

Keyword(s):

Automatic Generation ◽

Motion Trajectory ◽

Sports Video ◽

Xml Document

Download Full-text

XML Document Transformation for Data Manipulation Operations

10.1109/ubmk52708.2021.9559019 ◽

2021 ◽

Author(s):

Aigul Mukhitova ◽

Aigerim Yerimbetova ◽

Nenad Mladenovic

Keyword(s):

Data Manipulation ◽

Xml Document ◽

Document Transformation

Download Full-text

From Capturing to XML　Document Exchange Total Solution

Joho Chishiki Gakkaishi ◽

10.2964/jsik_kj00001039689 ◽

1999 ◽

Vol 9 (3) ◽

pp. 45-48

Author(s):

Fisher Lee

Keyword(s):

Xml Document

Download Full-text

A Framework for Learning Comprehensible Theories in XML Document Classification

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2011.158 ◽

2012 ◽

Vol 24 (1) ◽

pp. 1-14 ◽

Cited By ~ 9

Author(s):

Jemma Wu

Keyword(s):

Document Classification ◽

Xml Document

Download Full-text

Keyword Search over Probabilistic XML Documents Based on Node Classification

Mathematical Problems in Engineering ◽

10.1155/2015/210961 ◽

2015 ◽

Vol 2015 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Yue Zhao ◽

Ye Yuan ◽

Guoren Wang

Keyword(s):

Keyword Search ◽

Possible World ◽

Xml Data ◽

Fast Learning ◽

Probabilistic Xml ◽

Learning Speed ◽

Xml Document ◽

Probability Threshold ◽

Node Classification ◽

Learning Machine

This paper describes a keyword search measure on probabilistic XML data based on ELM (extreme learning machine). We use this method to carry out keyword search on probabilistic XML data. A probabilistic XML document differs from a traditional XML document to realize keyword search in the consideration of possible world semantics. A probabilistic XML document can be seen as a set of nodes consisting of ordinary nodes and distributional nodes. ELM has good performance in text classification applications. As the typical semistructured data; the label of XML data possesses the function of definition itself. Label and context of the node can be seen as the text data of this node. ELM offers significant advantages such as fast learning speed, ease of implementation, and effective node classification. Set intersection can compute SLCA quickly in the node sets which is classified by using ELM. In this paper, we adopt ELM to classify nodes and compute probability. We propose two algorithms that are based on ELM and probability threshold to improve the overall performance. The experimental results verify the benefits of our methods according to various evaluation metrics.

Download Full-text

A XML Document Coding Schema Based on Complete Binary Tree Traversal

Proceedings of the 2013 International Conference on Advanced Information Engineering and Education Science (ICAIEES 2013) ◽

10.2991/icaiees-13.2013.36 ◽

2013 ◽

Author(s):

Ying Chen ◽

Liyong Wan ◽

Cheng Luo

Keyword(s):

Binary Tree ◽

Complete Binary Tree ◽

Tree Traversal ◽

Xml Document

Download Full-text

An Experimental Study on the Performance of Element-based XML Document Retrieval

Journal of the Korean Society for information Management ◽

10.3743/kosim.2006.23.1.201 ◽

2006 ◽

Vol 23 (1) ◽

pp. 201-219

Keyword(s):

Experimental Study ◽

Document Retrieval ◽

Xml Document

Download Full-text

Discovering XML Conditional Dependencies for Data Quality Issues

European Journal of Electrical Engineering and Computer Science ◽

10.24018/ejece.2020.4.1.156 ◽

2020 ◽

Vol 4 (1) ◽

Author(s):

Mohammed Ragheb Hakawati ◽

Yasmin Yacob ◽

Amiza Amir ◽

Jabiry M. Mohammed ◽

Khalid Jamal Jadaa

Keyword(s):

Data Quality ◽

Primary Standard ◽

Markup Language ◽

Document Type ◽

Data Dependencies ◽

Master Data ◽

Xml Document ◽

Extensible Markup ◽

Quality Issues ◽

Mining Algorithms

Extensible Markup Language (XML) is emerging as the primary standard for representing and exchanging data, with more than 60% of the total; XML considered the most dominant document type over the web; nevertheless, their quality is not as expected. XML integrity constraint especially XFD plays an important role in keeping the XML dataset as consistent as possible, but their ability to solve data quality issues is still intangible. The main reason is that old-fashioned data dependencies were basically introduced to maintain the consistency of the schema rather than that of the data. The purpose of this study is to introduce a method for discovering pattern tableaus for XML conditional dependencies to be used for enhancing XML document consistency as a part of data quality improvement phases. The notations of the conditional dependencies as new rules are designed mainly for improving data instance and extended traditional XML dependencies by enforcing pattern tableaus of semantically related constants. Subsequent to this, a set of minimal approximate conditional dependencies (XCFD, XCIND) is discovered and learned from the XML tree using a set of mining algorithms. The discovered patterns can be used as a Master data in order to detect inconsistencies that don’t respect the majority of the dataset.

Download Full-text