The XML Data Mining Research Based on the Multi-Level Technology

2014 ◽  
Vol 644-650 ◽  
pp. 1875-1878
Author(s):  
Su Yu Huang ◽  
Ping Fang Hu

XML has become the standard form of data exchange, more and more data in this form for storage, implying a lot of knowledge in these data information, the need for data mining processing. For XML data mining method at present, most of the need is to pass the XML data into relational data pretreatment process, using the traditional method for processing, data mining process is complex and the effect is not ideal. Therefore, there is an urgent need some effective methods for XML data mining directly.

2013 ◽  
Vol 846-847 ◽  
pp. 1574-1577
Author(s):  
Jian Xin Zhu

The era of Web 2.0 has been coming, and more and more Web 2.0 application, such social networks and Wikipedia, have come up. As an industrial standard of the Web 2.0, the XML technique has also attracted more and more researchers. However, how to mine value information from massive XML documents is still in its infancy. In this paper, we study the basic problem of XML data mining-XML data mining model. We design a multi-level XML data mining model, propose a multi-level data mining method, and list some research issues in the implementation of XML data mining systems.


Author(s):  
Qin Ding

With the growing usage of XML data for data storage and exchange, there is an imminent need to develop efficient algorithms to perform data mining on semistructured XML data. Mining on XML data is much more difficult than mining on relational data because of the complexity of structure in XML data. A naïve approach to mining on XML data is to first convert XML data into relational format. However the structure information may be lost during the conversion. It is desired to develop efficient and effective data mining algorithms that can be directly applied on XML data.


2014 ◽  
Vol 687-691 ◽  
pp. 1466-1469
Author(s):  
Zhen Chao Wang

In the process of massive student data mining using traditional method, special words and related characteristics were used as mining objects. The concealment and feature of deliberately camouflaged of information made it is difficult for mining model to form an effective cluster centers, which reduced the accuracy of information mining. Hence an optimized data mining method was proposed. According to the degree of generalization and fuzziness of the feature words of student, the threshold of mining information was set, which avoided the effects of redundant information, thus the efficiency of mining was improved. The experimental results showed that using the improved algorithm to perform information mining in massive student database could effectively improve mining efficiency.


Sign in / Sign up

Export Citation Format

Share Document