Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model

Existing probabilistic retrieval models do not restrict the domain of the random variables that they deal with. In this article, we show that the upper bound of the normalized term frequency ( tf ) from the relevant documents is much smaller than the upper bound of the normalized tf from the whole collection. As a result, the existing models suffer from two major problems: (i) the domain mismatch causes data modeling error, (ii) since the outliers have very large magnitude and the retrieval models follow tf hypothesis, the combination of these two factors tends to overestimate the relevance score. In an attempt to address these problems, we propose novel weighted probabilistic models based on truncated distributions. We evaluate our models on a set of large document collections. Significant performance improvement over six existing probabilistic models is demonstrated.

Download Full-text

Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model

10.1007/springerreference_63938 ◽

2011 ◽

Keyword(s):

Retrieval Models ◽

Probabilistic Retrieval

Download Full-text

On event space and rank equivalence between probabilistic retrieval models

Information Retrieval ◽

10.1007/s10791-008-9062-z ◽

2008 ◽

Vol 11 (6) ◽

pp. 539-561 ◽

Cited By ~ 10

Author(s):

Robert W. P. Luk

Keyword(s):

Event Space ◽

Retrieval Models ◽

Probabilistic Retrieval

Download Full-text

Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model

Encyclopedia of Database Systems ◽

10.1007/978-1-4899-7993-3_919-2 ◽

2016 ◽

pp. 1-7

Author(s):

Thomas Roelleke ◽

Jun Wang ◽

Stephen Robertson

Keyword(s):

Retrieval Models ◽

Probabilistic Retrieval

Download Full-text

A Survey on Information Retrieval Models, Techniques and Applications

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i7.90 ◽

2017 ◽

Vol 7 (7) ◽

pp. 16 ◽

Cited By ~ 1

Author(s):

Ndengabaganizi Tonny James ◽

Rajkumar Kannan

Keyword(s):

Information Retrieval ◽

Retrieval Models ◽

Knowledge Based ◽

Long Time

It has been long time many people have realized the importance of archiving and finding information. With the advent of computers, it became possible to store large amounts of information; and finding useful information from such collections became a necessity. Over the last forty years, Information Retrieval (IR) has matured considerably. Several IR systems are used on an everyday basis by a wide variety of users. Information retrieval (IR) is generally concerned with the searching and retrieving of knowledge-based information from database. In this paper, we will discuss about the various models and techniques and for information retrieval. We are also providing the overview of traditional IR models.

Download Full-text

Probabilistic retrieval: thresholding for automatic filtering

10.1049/ic:19990891 ◽

1999 ◽

Author(s):

S. Robertson

Keyword(s):

Probabilistic Retrieval ◽

Automatic Filtering

Download Full-text