LTRRS: A Learning to Rank Based Algorithm for Resource Selection in Distributed Information Retrieval

Opposed to centralized search where Websites are crawled and indexed, Distributed Information Retrieval (DIR), also known as Federated Search, is a powerful way to comprehensively search multiple databases in real-time simultaneously. DIR is preferred to centralized search environments in a number of ways, characteristically among them are: 1. the diversity of resources that are made available; 2. improving scalability and reducing server load and network traffic; 3. the leverage of accessing the hidden or deep Web.There are three major phases/tasks of a DIR (i) resource description or collection representation (ii) resource selection and (iii) result merging. This paper aims at providing a comprehensive review on the various phases of DIR and also some current strategies being recommended in enhancing and improving the smooth implementation of a DIR system.

Download Full-text

DISTRIBUTED INFORMATION RETRIEVAL: A MULTI-OBJECTIVE RESOURCE SELECTION APPROACH

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s0218488503002284 ◽

2003 ◽

Vol 11 (supp01) ◽

pp. 83-99 ◽

Cited By ~ 7

Author(s):

SHENGLI WU ◽

FABIO CRESTANI

Keyword(s):

Information Retrieval ◽

Data Fusion ◽

Resource Selection ◽

Distributed Information ◽

Monetary Cost ◽

Distributed Information Retrieval ◽

Multi Objective ◽

Selection Approach ◽

Objective Model ◽

The Given

Information retrieval is becoming increasingly concerned with resource selection and data fusion for distributed archives. In distributed information retrieval, a user submits a query to a broker, which determines a solution for how to yield a given number of documents from all available resources. In this paper, we present a multi-objective model for resource selection, in which four aspects: a document's relevance to the given query, time, monetary cost, and the chance of getting document duplicates from resources, are considered simultaneously. Some variants of this multi-objective model, aimed at achieving better implementation efficiency, are also proposed.

Download Full-text

A Set-Covering-Based Approach for Overlapping Resource Selection in Distributed Information Retrieval

2009 WRI World Congress on Computer Science and Information Engineering ◽

10.1109/csie.2009.702 ◽

2009 ◽

Cited By ~ 2

Author(s):

Xiuhong Wang ◽

Shiguang Ju

Keyword(s):

Information Retrieval ◽

Resource Selection ◽

Set Covering ◽

Distributed Information ◽

Distributed Information Retrieval

Download Full-text

Result merging methods in distributed information retrieval with overlapping databases

Information Retrieval ◽

10.1007/s10791-007-9023-y ◽

2007 ◽

Vol 10 (3) ◽

pp. 297-319 ◽

Cited By ~ 10

Author(s):

Shengli Wu ◽

Sally McClean

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Result Merging

Download Full-text

Distributed Information Retrieval

The Kluwer International Series in Engineering and Computer Science - Information Retrieval: Algorithms and Heuristics ◽

10.1007/978-1-4615-5539-1_7 ◽

1998 ◽

pp. 201-219 ◽

Cited By ~ 2

Author(s):

David A. Grossman ◽

Ophir Frieder

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval

Download Full-text

Autonomic Supervision of Stigmergic Self-Organisation for Distributed Information Retrieval

10.4108/icst.bionetics2007.2357 ◽

2007 ◽

Cited By ~ 1

Author(s):

Kieran Greer ◽

Matthias Baumgarten ◽

Maurice Mulvenna ◽

Kevin Curran ◽

Chris Nugent

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Self Organisation

Download Full-text

Considering operational issues for multiagent conceptual inferencing in a distributed information retrieval application

Web Intelligence and Agent Systems: An International Journal ◽

10.3233/wia-2008-0127 ◽

2008 ◽

Vol 6 (1) ◽

pp. 1-28 ◽

Cited By ~ 1

Author(s):

Leen-Kiat Soh

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Operational Issues

Download Full-text

Central-Rank-Based Collection Selection in Uncooperative Distributed Information Retrieval

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-540-71496-5_17 ◽

2007 ◽

pp. 160-172 ◽

Cited By ~ 35

Author(s):

Milad Shokouhi

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Collection Selection

Download Full-text

Harvesting: Broadening the Field of Distributed Information Retrieval

Distributed Multimedia Information Retrieval - Lecture Notes in Computer Science ◽

10.1007/978-3-540-24610-7_1 ◽

2004 ◽

pp. 1-20 ◽

Cited By ~ 2

Author(s):

Edward A. Fox ◽

Marcos A. Gonçalves ◽

Ming Luo ◽

Yuxin Chen ◽

Aaron Krowne ◽

...

Keyword(s):

Information Retrieval ◽

Distributed Information ◽

Distributed Information Retrieval

Download Full-text

Improving Results Aggregation Strategies in Distributed Information Retrieval

International Journal of Engineering Research in Africa ◽

10.4028/www.scientific.net/jera.17.94 ◽

2015 ◽

Vol 17 ◽

pp. 94-104

Author(s):

Benjamin Ghansah ◽

Sheng Li Wu ◽

Nathaniel Ekow Ghansah

Keyword(s):

Information Retrieval ◽

User Satisfaction ◽

Information Needs ◽

Relevant Information ◽

General Purpose ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Result Diversification ◽

Result Merging ◽

Ranked List

The top-ranked documents from various information sources that are merged together into a unified ranked list may cover the same piece of relevant information, and cannot satisfy different user needs. Result diversification(RD) solves this problem by diversifying results to cover more information needs. In recent times, RD has attracted much attention as a means of increasing user satisfaction in general purpose search engines. A myriad of approaches have been proposed in the related works for the diversification problem. However, no concrete study of search result diversification has been done in a Distributed Information Retrieval(DIR) setting. In this paper, we survey, classify and propose a theoretical framework that aims at improving diversification at the result merging phase of a DIR environment.

Download Full-text