scholarly journals XStruct: Efficient Schema Extraction from Multiple and Large XML Documents

Author(s):  
J. Hegewald ◽  
F. Naumann ◽  
M. Weis

2002 ◽  
Vol 9D (3) ◽  
pp. 381-388
Author(s):  
Seong-Rim Kim ◽  
Yong-Ik Yun


Author(s):  
Giovanna Guerrini ◽  
Marco Mesiti ◽  
Elisa Bertino

This chapter discusses existing approaches to evaluate and measure structural similarity in sources of XML documents. A relevant peculiarity of XML documents, indeed, is that information on the document structure is available in the document itself. In the chapter we present different approaches aiming at evaluating structural similarity at three different levels: among documents, between a document and a schema, and among schemas. The most relevant applications of such measures are for document classification and schema extraction, and for document and schema structural clustering, though other interesting applications such as document change detection and structural querying can be devised, and will be discussed throughout the chapter.



Author(s):  
Huiping Cao ◽  
Yan Qi ◽  
K. Selçuk Candan ◽  
Maria Luisa Sapino

Many applications require exchange and integration of data from multiple, heterogeneous sources. eXtensible Markup Language (XML) is a standard developed to satisfy the convenient data exchange needs of these applications. However, XML by itself does not address the data integration requirements. This chapter discusses the challenges and techniques in XML Data Integration. It first presents a four step outline, illustrating the steps involved in the integration of XML data. This chapter, then, focuses on the first two of these steps: schema extraction and data/schema mapping. More specifically, schema extraction presents techniques to extract tree summaries, DTDs, or XML Schemas from XML documents. The discussion on data/schema mapping focuses on techniques for aligning XML data and schemas.



Author(s):  
Yin Zhang ◽  
Hua Zhou ◽  
Junhui Liu ◽  
Zhihong Liang ◽  
Peng Duan


Author(s):  
Abhilash Gummadi ◽  
Jong P. Yoon ◽  
Biren Shah ◽  
Vijay Raghavan
Keyword(s):  


Author(s):  
Abdelsalam M. Maatuk ◽  
Tawfig Abdelaziz ◽  
M. Akhtar Ali


Sign in / Sign up

Export Citation Format

Share Document