Proximity-Based Good Turing Discounting and Kernel Functions for Pseudo-Relevance Feedback

During the last few years, it has become abundantly clear that the technological advances in information technology have led to the dramatic proliferation of information on the web and this, in turn, has led to the appearance of new words in the Internet. Due to the difficulty of reaching the meanings of these new terms, which play an essential role in retrieving the desired information, it becomes necessary to give more importance to the sites and topics where these new words appear, or rather, to give value to the words that occur frequently with them. For this purpose, in this paper, the authors propose a new robust correlation measure that assesses the relatedness of words for pseudo-relevance feedback. It is based on the co-occurrence and closeness of terms, and aims to select the appropriate words that best capture the user information need. Extensive experiments have been conducted on the OHSUMED test collection and the results show that the proposed approach achieves a considerable performance improvement over the baseline.

Download Full-text

Proximity-Based Good Turing Discounting and Kernel Functions for Pseudo-Relevance Feedback

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch100 ◽

2018 ◽

pp. 2244-2266

Author(s):

Ilyes Khennak ◽

Bab Ezzouar

Keyword(s):

Relevance Feedback ◽

Essential Role ◽

Kernel Functions ◽

The Internet ◽

Test Collection ◽

Information Need ◽

New Words ◽

Technological Advances ◽

Pseudo Relevance Feedback ◽

The Web

Download Full-text

Hybrid Query Expansion Model Based on Pseudo Relevance Feedback and Semantic Tree for Arabic IR

International Journal of Information Retrieval Research ◽

10.4018/ijirr.289949 ◽

2022 ◽

Vol 12 (1) ◽

pp. 0-0

Keyword(s):

Relevance Feedback ◽

Query Expansion ◽

Semantic Relations ◽

Information Need ◽

Weighted Tree ◽

Average Precision ◽

Arabic Information Retrieval ◽

Semantic Tree ◽

Expansion Model ◽

Pseudo Relevance Feedback

In this paper, the authors propose and readapt a new concept-based approach of query expansion in the context of Arabic information retrieval. The purpose is to represent the query by a set of weighted concepts in order to identify better the user's information need. Firstly, concepts are extracted from the initially retrieved documents by the Pseudo-Relevance Feedback method, and then they are integrated into a semantic weighted tree in order to detect more information contained in the related concepts connected by semantic relations to the primary concepts. The authors use the “Arabic WordNet” as a resource to extract, disambiguate concepts and build the semantic tree. Experimental results demonstrate that measure of MAP (Mean Average Precision) is about 10% of improvement using the open source Lucene as IR System on a collection formed from the Arabic BBC news.

Download Full-text

Context Window Based Co-occurrence Approach for Improving Feedback Based Query Expansion in Information Retrieval

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2015100103 ◽

2015 ◽

Vol 5 (4) ◽

pp. 31-45 ◽

Cited By ~ 11

Author(s):

Jagendra Singh ◽

Aditi Sharan

Keyword(s):

Information Retrieval ◽

Relevance Feedback ◽

Query Expansion ◽

Contextual Information ◽

Optimal Combination ◽

Query Term ◽

Benchmark Data ◽

First Pass ◽

Baseline Approach ◽

Pseudo Relevance Feedback

Pseudo-relevance feedback (PRF) is a type of relevance feedback approach of query expansion that considers the top ranked retrieved documents as relevance feedback. In this paper the authors focus is to capture the limitation of co-occurrence and PRF based query expansion approach and the authors proposed a hybrid method to improve the performance of PRF based query expansion by combining query term co-occurrence and query terms contextual information based on corpus of top retrieved feedback documents in first pass. Firstly, the paper suggests top retrieved feedback documents based query term co-occurrence approach to select an optimal combination of query terms from a pool of terms obtained using PRF based query expansion. Second, contextual window based approach is used to select the query context related terms from top feedback documents. Third, comparisons were made among baseline, co-occurrence and contextual window based approaches using different performance evaluating metrics. The experiments were performed on benchmark data and the results show significant improvement over baseline approach.

Download Full-text