A bias–variance evaluation framework for information retrieval systems

The absence of reliable and efficient techniques to evaluate information retrieval systems has become a bottleneck in the development of novel retrieval methods. In traditional approaches users or hired evaluators provide manual assessments of relevance. However these approaches are neither efficient nor reliable since they do not scale with the complexity and heterogeneity of available digital information. Automatic approaches, on the other hand, could be efficient but disregard semantic data, which is usually important to assess the actual performance of the evaluated methods. This article proposes to use topic ontologies and semantic similarity data derived from these ontologies to implement an automatic semantic evaluation framework for information retrieval systems. The use of semantic simi- larity data allows to capture the notion of partial relevance, generalizing traditional evaluation metrics, and giving rise to novel performance measures such as semantic precision and semantic harmonic mean. The validity of the approach is supported by user studies and the application of the proposed framework is illustrated with the evaluation of topical retrieval systems. The evaluated systems include a baseline, a supervised version of the Bo1 query refinement method and two multi-objective evolutionary algorithms for context-based retrieval. Finally, we discuss the advantages of ap- plying evaluation metrics that account for semantic similarity data and partial relevance over existing metrics based on the notion of total relevance.

Download Full-text

Consensus-based evaluation framework for distributed information retrieval systems

Knowledge and Information Systems ◽

10.1007/s10115-008-0153-3 ◽

2008 ◽

Vol 18 (2) ◽

pp. 199-211 ◽

Cited By ~ 14

Author(s):

Jason J. Jung

Keyword(s):

Information Retrieval ◽

Evaluation Framework ◽

Distributed Information ◽

Distributed Information Retrieval ◽

Retrieval Systems ◽

Information Retrieval Systems

Download Full-text

Consensus-Based Evaluation Framework for Cooperative Information Retrieval Systems

Agent and Multi-Agent Systems: Technologies and Applications - Lecture Notes in Computer Science ◽

10.1007/978-3-540-72830-6_18 ◽

2007 ◽

pp. 169-178 ◽

Cited By ~ 2

Author(s):

Jason J. Jung ◽

Geun-Sik Jo

Keyword(s):

Information Retrieval ◽

Evaluation Framework ◽

Retrieval Systems ◽

Information Retrieval Systems

Download Full-text

Relevance Predictability in Information Retrieval Systems

Methods of Information in Medicine ◽

10.1055/s-0038-1636254 ◽

1967 ◽

Vol 06 (02) ◽

pp. 45-51 ◽

Cited By ~ 6

Author(s):

A. Kent ◽

J. Belzer ◽

M. Kuhfeerst ◽

E. D. Dym ◽

D. L. Shirey ◽

...

Keyword(s):

Information Retrieval ◽

Experimental Conditions ◽

Treatment Groups ◽

Retrieval Systems ◽

Significant Difference ◽

Information Retrieval Systems ◽

High Predictability ◽

Intermediate Response ◽

Quantitative Indicators ◽

Level Of Processing

An experiment is described which attempts to derive quantitative indicators regarding the potential relevance predictability of the intermediate stimuli used to represent documents in information retrieval systems. In effect, since the decision to peruse an entire document is often predicated upon the examination of one »level of processing« of the document (e.g., the citation and/or abstract), it became interesting to analyze the properties of what constitutes »relevance«. However, prior to such an analysis, an even more elementary step had to be made, namely, to determine what portions of a document should be examined.An evaluation of the ability of intermediate response products (IRPs), functioning as cues to the information content of full documents, to predict the relevance determination that would be subsequently made on these documents by motivated users of information retrieval systems, was made under controlled experimental conditions. The hypothesis that there might be other intermediate response products (selected extracts from the document, i.e., first paragraph, last paragraph, and the combination of first and last paragraph), that would be as representative of the full document as the traditional IRPs (citation and abstract) was tested systematically. The results showed that:1. there is no significant difference among the several IRP treatment groups on the number of cue evaluations of relevancy which match the subsequent user relevancy decision on the document;2. first and last paragraph combinations have consistently predicted relevancy to a higher degree than the other IRPs;3. abstracts were undistinguished as predictors; and4. the apparent high predictability rating for citations was not substantive.Some of these results are quite different than would be expected from previous work with unmotivated subjects.

Download Full-text

Methods for Evaluating Interactive Information Retrieval Systems with Users

10.1561/9781601982254 ◽

2007 ◽

Author(s):

Diane Kelly

Keyword(s):

Information Retrieval ◽

Interactive Information Retrieval ◽

Retrieval Systems ◽

Information Retrieval Systems

Download Full-text

Performance of eight information retrieval systems in the Spanish web

El profesional de la información ◽

10.3145/epi.2005.sep.03 ◽

2005 ◽

Vol 14 (5) ◽

pp. 335-346

Author(s):

Por Carlos Benito Amat ◽

Por Carlos Benito Amat

Keyword(s):

Information Retrieval ◽

Retrieval Systems ◽

Information Retrieval Systems

Download Full-text

A Dialectical Approach to Search Engine Evaluation

Libri ◽

10.1515/libri-2019-0142 ◽

2020 ◽

Vol 70 (3) ◽

pp. 227-237

Author(s):

Mahdi Zeynali-Tazehkandi ◽

Mohsen Nowkarizi

Keyword(s):

Information Retrieval ◽

Information Science ◽

Library And Information Science ◽

Related Literature ◽

Philosophical Foundations ◽

Dialectical Approach ◽

Retrieval Systems ◽

The World ◽

Information Retrieval Systems ◽

Oriented Approach

AbstractEvaluation of information retrieval systems is a fundamental topic in Library and Information Science. The aim of this paper is to connect the system-oriented and the user-oriented approaches to relevant philosophical schools. By reviewing the related literature, it was found that the evaluation of information retrieval systems is successful if it benefits from both system-oriented and user-oriented approaches (composite). The system-oriented approach is rooted in Parmenides’ philosophy of stability (immovable) which Plato accepts and attributes to the world of forms; the user-oriented approach is rooted in Heraclitus’ flux philosophy (motion) which Plato defers and attributes to the tangible world. Thus, using Plato’s theory is a comprehensive approach for recognizing the concept of relevance. The theoretical and philosophical foundations determine the type of research methods and techniques. Therefore, Plato’s dialectical method is an appropriate composite method for evaluating information retrieval systems.

Download Full-text