Efficient Evaluation of Partial Match Queries for XML Documents Using Information Retrieval Techniques

GuessXQ

Innovations in XML Applications and Metadata Management ◽

10.4018/978-1-4666-2669-0.ch004 ◽

2013 ◽

pp. 57-76

Author(s):

Daniela Morais Fonte ◽

Daniela da Cruz ◽

Pedro Rangel Henriques ◽

Alda Lopes Gancarski

Keyword(s):

Information Retrieval ◽

Learning Process ◽

Web Application ◽

Relational Databases ◽

Query Language ◽

General Purpose ◽

Considerable Effort ◽

Document Structure ◽

Query By Example ◽

Xml Documents

XML is a widely used general-purpose annotation formalism for creating custom markup languages. XML annotations give structure to plain documents to interpret their content. To extract information from XML documents XPath and XQuery languages can be used. However, the learning of these dialects requires a considerable effort. In this context, the traditional Query-By-Example methodology (for Relational Databases) can be an important contribution to leverage this learning process, freeing the user from knowing the specific query language details or even the document structure. This chapter describes how to apply the Query-By-Example concept in a Web-application for information retrieval from XML documents, the GuessXQ system. This engine is capable of deducing, from an example, the respective XQuery statement. The example consists of marking the desired components directly on a sample document, picked-up from a collection. After inferring the corresponding query, GuessXQ applies it to the collection to obtain the desired result.

Download Full-text

Technologies for Information Access and Knowledge Management

Encyclopedia of Information Science and Technology, Second Edition ◽

10.4018/978-1-60566-026-4.ch587 ◽

2011 ◽

pp. 3680-3685

Author(s):

Thomas Mandl

Keyword(s):

Information Retrieval ◽

Search Engines ◽

Information Access ◽

Partial Match ◽

Retrieval Models ◽

Automatic Indexing ◽

Patent Retrieval ◽

Information Retrieval Systems ◽

Information Work ◽

The 1960S

In the 1960s, automatic indexing methods for texts were developed. They had already implemented the “bag-ofwords” approach, which still prevails. Although automatic indexing is widely used today, many information providers and even Internet services still rely on human information work. In the 1970s, research shifted its interest to partial-match retrieval models and proved their superiority over Boolean retrieval models. Vector-space and later probabilistic retrieval models were developed. However, it took until the 1990s for partial-match models to succeed in the market. The Internet played a great role in this success. All Web search engines were based on partial-match models and provided ranked lists as results rather than unordered sets of documents. Consumers got used to this kind of search systems, and all big search engines included partial-match functionality. However, there are many niches in which Boolean methods still dominate, for example, patent retrieval. The basis for information retrieval systems may be pictures, graphics, videos, music objects, structured documents, or combinations thereof. This article is mainly concerned with information retrieval for text documents.

Download Full-text

Information Retrieval System for XML Documents

Lecture Notes in Computer Science - Database and Expert Systems Applications ◽

10.1007/3-540-46146-9_75 ◽

2002 ◽

pp. 758-767 ◽

Cited By ~ 4

Author(s):

Kenji Hatano ◽

Hiroko Kinutani ◽

Masatoshi Yoshikawa ◽

Shunsuke Uemura

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Xml Documents

Download Full-text

Efficient evaluation of linear path expressions on large-scale heterogeneous XML documents using information retrieval techniques

Journal of Systems and Software ◽

10.1016/j.jss.2005.05.009 ◽

2006 ◽

Vol 79 (2) ◽

pp. 180-190 ◽

Cited By ~ 6

Author(s):

Young-Ho Park ◽

Kyu-Young Whang ◽

Byung Suk Lee ◽

Wook-Shin Han

Keyword(s):

Information Retrieval ◽

Large Scale ◽

Linear Path ◽

Xml Documents

Download Full-text

System of Information Retrieval in XML Documents

Effective Databases for Text & Document Management ◽

10.4018/978-1-93177-747-6.ch001 ◽

2011 ◽

pp. 1-11 ◽

Cited By ~ 2

Author(s):

Saliha Smadhi

Keyword(s):

Information Retrieval ◽

Xml Documents

Download Full-text

Flexible Information Retrieval on XML Documents

Lecture Notes in Computer Science - Intelligent Search on XML Data ◽

10.1007/978-3-540-45194-5_6 ◽

2003 ◽

pp. 95-106 ◽

Cited By ~ 8

Author(s):

Torsten Grabs ◽

Hans-Jörg Schek

Keyword(s):

Information Retrieval ◽

Xml Documents

Download Full-text

Structured information retrieval in XML documents

Proceedings of the 2002 ACM symposium on Applied computing - SAC '02 ◽

10.1145/508791.508919 ◽

2002 ◽

Cited By ~ 25

Author(s):

Evangelos Kotsakis

Keyword(s):

Information Retrieval ◽

Xml Documents ◽

Structured Information Retrieval ◽

Structured Information

Download Full-text

Algebraic Modeling of Information Retrieval in XML Documents

10.1063/1.3271632 ◽

2009 ◽

Author(s):

Bozhidar Georgiev ◽

Adriana Georgieva ◽

George Venkov ◽

Ralitza Kovacheva ◽

Vesela Pasheva

Keyword(s):

Information Retrieval ◽

Xml Documents

Download Full-text

Interactive information retrieval from XML documents represented by attribute grammars

Proceedings of the 2003 ACM symposium on Document engineering - DocEng '03 ◽

10.1145/958220.958251 ◽

2003 ◽

Cited By ~ 2

Author(s):

Alda Lopes Gançarski ◽

Pedro Rangel Henriques

Keyword(s):

Information Retrieval ◽

Attribute Grammars ◽

Interactive Information Retrieval ◽

Xml Documents

Download Full-text

Distributed processing of queries for XML documents in an agent based information retrieval system

Proceedings 2000 Kyoto International Conference on Digital Libraries: Research and Practice ◽

10.1109/dlrp.2000.942181 ◽

2002 ◽

Cited By ~ 2

Author(s):

B. Czejdo ◽

R. Miller ◽

M. Taylor ◽

M. Rusinkiewicz

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Distributed Processing ◽

Information Retrieval System ◽

Agent Based ◽

Xml Documents

Download Full-text