Two-Phase Path Retrieval Method for Similar XML Document Retrieval

Lecture Notes in Computer Science - Knowledge-Based Intelligent Information and Engineering Systems ◽

10.1007/11552413_138 ◽

2005 ◽

pp. 967-971 ◽

Cited By ~ 1

Author(s):

Jae-Min Lee ◽

Byung-Yeon Hwang

Keyword(s):

Document Retrieval ◽

Two Phase ◽

Retrieval Method ◽

Xml Document ◽

Phase Path

Download Full-text

A Novel XML Document Retrieval Method

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.981.167 ◽

2014 ◽

Vol 981 ◽

pp. 167-170

Author(s):

Jiang Xin Chen ◽

Li Yong Wan

Keyword(s):

Document Retrieval ◽

Recall Rate ◽

Data Representation ◽

High Accuracy ◽

Xml Retrieval ◽

Retrieval Method ◽

Xml Document ◽

Retrieval Efficiency ◽

Important Study ◽

Structure Extension

XML is a new standard for data representation and exchange, which has been widely used on the Internet. XML retrieval has caused more and more researchers concern. How to improve the retrieval efficiency has become an important study field, this paper proposes the retrieval method based on keyword weight and structure extension, which can get the middle retrieval results in accordance with calculating the keyword weight, and further to retrieval by implementing structural extension of middle retrieval results, then can get the final query results. The experiment results show that the retrieval method has high accuracy and the recall rate compared with the traditional retrieval method.

Download Full-text

The Research and Application in Intelligent Document Retrieval Based on Text Quantification and Subject Mapping

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.605-607.2561 ◽

2012 ◽

Vol 605-607 ◽

pp. 2561-2568

Author(s):

Qin Wang ◽

Shou Ning Qu ◽

Tao Du ◽

Ming Jing Zhang

Keyword(s):

Association Rule ◽

Document Retrieval ◽

Retrieval Method ◽

Feature Extraction Method ◽

Quantification Method ◽

Academic Exchange ◽

Similarity Calculation ◽

Five Elements ◽

Mapping Scheme ◽

Text Element

Nowadays, document retrieval was an important way of academic exchange and achieving new knowledge. Choosing corresponding category of database and matching the input key words was the traditional document retrieval method. Using the method, a mass of documents would be got and it was hard for users to find the most relevant document. The paper put forward text quantification method. That was mining the features of each element in some document, which including word concept, weight value for position function, improved weights characteristic value, text distribution function weights value and text element length. Then the word’ contributions to this document would be got from the combination of five elements characteristics. Every document in database was stored digitally by the contribution of elements. And a subject mapping scheme was designed in the paper, which the similarity calculation method based on contribution and association rule was firstly designed, according to the method, the documents in the database would be conducted text clustering, and then feature extraction method was used to find class subject. When searching some document, the description which users input would be quantified and mapped to some class automatically by subject mapping, then the document sequences would be retrieved by computing the similarity between the description and the other documents’ features in the class. Experiment shows that the scheme has many merits such as intelligence, accuracy as well as improving retrieval speed.

Download Full-text