Two-Phase Path Retrieval Method for Similar XML Document Retrieval

Author(s):  
Jae-Min Lee ◽  
Byung-Yeon Hwang
2014 ◽  
Vol 981 ◽  
pp. 167-170
Author(s):  
Jiang Xin Chen ◽  
Li Yong Wan

XML is a new standard for data representation and exchange, which has been widely used on the Internet. XML retrieval has caused more and more researchers concern. How to improve the retrieval efficiency has become an important study field, this paper proposes the retrieval method based on keyword weight and structure extension, which can get the middle retrieval results in accordance with calculating the keyword weight, and further to retrieval by implementing structural extension of middle retrieval results, then can get the final query results. The experiment results show that the retrieval method has high accuracy and the recall rate compared with the traditional retrieval method.


2012 ◽  
Vol 605-607 ◽  
pp. 2561-2568
Author(s):  
Qin Wang ◽  
Shou Ning Qu ◽  
Tao Du ◽  
Ming Jing Zhang

Nowadays, document retrieval was an important way of academic exchange and achieving new knowledge. Choosing corresponding category of database and matching the input key words was the traditional document retrieval method. Using the method, a mass of documents would be got and it was hard for users to find the most relevant document. The paper put forward text quantification method. That was mining the features of each element in some document, which including word concept, weight value for position function, improved weights characteristic value, text distribution function weights value and text element length. Then the word’ contributions to this document would be got from the combination of five elements characteristics. Every document in database was stored digitally by the contribution of elements. And a subject mapping scheme was designed in the paper, which the similarity calculation method based on contribution and association rule was firstly designed, according to the method, the documents in the database would be conducted text clustering, and then feature extraction method was used to find class subject. When searching some document, the description which users input would be quantified and mapped to some class automatically by subject mapping, then the document sequences would be retrieved by computing the similarity between the description and the other documents’ features in the class. Experiment shows that the scheme has many merits such as intelligence, accuracy as well as improving retrieval speed.


2005 ◽  
Vol 23 (3) ◽  
pp. 267-298 ◽  
Author(s):  
Laurence A. F. Park ◽  
Kotagiri Ramamohanarao ◽  
Marimuthu Palaniswami

1989 ◽  
Vol 30 (2) ◽  
pp. 103-120 ◽  
Author(s):  
Tetsuya Murai ◽  
Masaaki Miyakoshi ◽  
Masaru Shimbo

Sign in / Sign up

Export Citation Format

Share Document