Ad hoc Retrieval models

Clear and Private Ad Hoc Retrieval Models on Web Data

Advances in Computational Intelligence and Robotics - Advanced Metaheuristic Methods in Big Data Retrieval and Analytics ◽

10.4018/978-1-5225-7338-8.ch009 ◽

2019 ◽

pp. 194-211

Author(s):

Souria Ortiga

Keyword(s):

Ad Hoc ◽

Digital Information ◽

Retrieval Models ◽

Efficient Access ◽

Ad Hoc Retrieval ◽

The Relationship ◽

Source Of Information ◽

Automated Methods ◽

Search Information ◽

Information Today

During the 1980s, and despite its maturity, the search information (RI) was only intended for librarians and experts in the field of information. Such tendentious vision prevailed for many years. Since the mid-90s, the web has become an increasingly crucial source of information , which has a renewed interest in IR. In the last decade, the popularization of computers, the terrible explosion in the amount of unstructured data, internal documents, and corporate collections, and the huge and growing number of internet document sources have deeply shaken the relationship between man and information. Today, a great change has taken place, and the RI is often used by billions of people around the world. Simply, the need for automated methods for efficient access to this huge amount of digital information has become more important, and appears as a necessity.

Download Full-text

A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval

ACM Transactions on Information Systems ◽

10.1145/3483612 ◽

2022 ◽

Vol 40 (3) ◽

pp. 1-37

Author(s):

Edward Kai Fung Dang ◽

Robert Wing Pong Luk ◽

James Allan

Keyword(s):

Ad Hoc ◽

Retrieval Model ◽

Ranking Functions ◽

Retrieval Models ◽

Document Ranking ◽

Reproducibility Study ◽

Wide Range ◽

Models Comparison ◽

Ad Hoc Retrieval ◽

Leading Term

In Information Retrieval, numerous retrieval models or document ranking functions have been developed in the quest for better retrieval effectiveness. Apart from some formal retrieval models formulated on a theoretical basis, various recent works have applied heuristic constraints to guide the derivation of document ranking functions. While many recent methods are shown to improve over established and successful models, comparison among these new methods under a common environment is often missing. To address this issue, we perform an extensive and up-to-date comparison of leading term-independence retrieval models implemented in our own retrieval system. Our study focuses on the following questions: (RQ1) Is there a retrieval model that consistently outperforms all other models across multiple collections; (RQ2) What are the important features of an effective document ranking function? Our retrieval experiments performed on several TREC test collections of a wide range of sizes (up to the terabyte-sized Clueweb09 Category B) enable us to answer these research questions. This work also serves as a reproducibility study for leading retrieval models. While our experiments show that no single retrieval model outperforms all others across all tested collections, some recent retrieval models, such as MATF and MVD, consistently perform better than the common baselines.

Download Full-text

Performance Comparison of Ad-Hoc Retrieval Models over Full-Text vs. Titles of Documents

Lecture Notes in Computer Science - Maturity and Innovation in Digital Libraries ◽

10.1007/978-3-030-04257-8_30 ◽

2018 ◽

pp. 290-303

Author(s):

Ahmed Saleh ◽

Tilman Beck ◽

Lukas Galke ◽

Ansgar Scherp

Keyword(s):

Full Text ◽

Ad Hoc ◽

Performance Comparison ◽

Retrieval Models ◽

Ad Hoc Retrieval

Download Full-text

Ad hoc Retrieval models

10.1007/springerreference_64397 ◽

2011 ◽

Keyword(s):

Ad Hoc ◽

Retrieval Models ◽

Ad Hoc Retrieval

Download Full-text

Implicit Entity Linking Through Ad-Hoc Retrieval

2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) ◽

10.1109/asonam.2018.8508612 ◽

2018 ◽

Cited By ~ 2

Author(s):

Hawre Hosseini ◽

Tam T. Nguyen ◽

Ebrahim Bagheri

Keyword(s):

Ad Hoc ◽

Entity Linking ◽

Ad Hoc Retrieval

Download Full-text

Improving English and Chinese ad-hoc retrieval

10.3115/1119089.1119113 ◽

1996 ◽

Author(s):

Kui-Lam Kwok

Keyword(s):

Ad Hoc ◽

Ad Hoc Retrieval

Download Full-text

Ad Hoc Retrieval of Documents with Topical Opinion

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-540-71496-5_37 ◽

2007 ◽

pp. 405-417 ◽

Cited By ~ 5

Author(s):

Jason Skomorowski ◽

Olga Vechtomova

Keyword(s):

Ad Hoc ◽

Ad Hoc Retrieval

Download Full-text

Frequent Case Generation in Ad Hoc Retrieval of Three Indian Languages – Bengali, Gujarati and Marathi

Multilingual Information Access in South Asian Languages - Lecture Notes in Computer Science ◽

10.1007/978-3-642-40087-2_4 ◽

2013 ◽

pp. 38-50 ◽

Cited By ~ 3

Author(s):

Jiaul H. Paik ◽

Kimmo Kettunen ◽

Dipasree Pal ◽

Kalervo Järvelin

Keyword(s):

Ad Hoc ◽

Indian Languages ◽

Ad Hoc Retrieval

Download Full-text

Improving Weak Ad-Hoc Retrieval by Web Assistance and Data Fusion

Information Retrieval Technology - Lecture Notes in Computer Science ◽

10.1007/11562382_2 ◽

2005 ◽

pp. 17-30 ◽

Cited By ~ 3

Author(s):

Kui-Lam Kwok ◽

Laszlo Grunfeld ◽

Peter Deng

Keyword(s):

Data Fusion ◽

Ad Hoc ◽

Ad Hoc Retrieval

Download Full-text

On the statistical optimality of CO<sub>2</sub> atmospheric inversions assimilating CO<sub>2</sub> column retrievals

Atmospheric Chemistry and Physics ◽

10.5194/acp-15-11133-2015 ◽

2015 ◽

Vol 15 (19) ◽

pp. 11133-11145 ◽

Cited By ~ 30

Author(s):

F. Chevallier

Keyword(s):

Bias Correction ◽

Ad Hoc ◽

Robust Statistics ◽

Model Simulation ◽

High Gain ◽

Ad Hoc Retrieval ◽

Air Sample ◽

Retrieval Bias ◽

Atmospheric Inversion ◽

Atmospheric Inversions

Abstract. The extending archive of the Greenhouse Gases Observing Satellite (GOSAT) measurements (now covering about 6 years) allows increasingly robust statistics to be computed, that document the performance of the corresponding retrievals of the column-average dry air-mole fraction of CO2 (XCO2). Here, we demonstrate that atmospheric inversions cannot be rigorously optimal when assimilating current XCO2 retrievals, even with averaging kernels, in particular because retrievals and inversions use different assumption about prior uncertainty. We look for some practical evidence of this sub-optimality from the view point of atmospheric inversion by comparing a model simulation constrained by surface air-sample measurements with one of the GOSAT retrieval products (NASA's ACOS). The retrieval-minus-model differences result from various error sources, both in the retrievals and in the simulation: we discuss the plausibility of the origin of the major patterns. We find systematic retrieval errors over the dark surfaces of high-latitude lands and over African savannahs. More importantly, we also find a systematic over-fit of the GOSAT radiances by the retrievals over land for the high-gain detector mode, which is the usual observation mode. The over-fit is partially compensated by the retrieval bias-correction. These issues are likely common to other retrieval products and may explain some of the surprising and inconsistent CO2 atmospheric inversion results obtained with the existing GOSAT retrieval products. We suggest that reducing the observation weight in the retrieval schemes (for instance so that retrieval increments to the retrieval prior values are halved for the studied retrieval product) would significantly improve the retrieval quality and reduce the need for (or at least reduce the complexity of) ad-hoc retrieval bias correction.

Download Full-text