Placing Query Term Proximity in Search Context

Pseudo-relevance feedback (PRF) is a type of relevance feedback approach of query expansion that considers the top ranked retrieved documents as relevance feedback. In this paper the authors focus is to capture the limitation of co-occurrence and PRF based query expansion approach and the authors proposed a hybrid method to improve the performance of PRF based query expansion by combining query term co-occurrence and query terms contextual information based on corpus of top retrieved feedback documents in first pass. Firstly, the paper suggests top retrieved feedback documents based query term co-occurrence approach to select an optimal combination of query terms from a pool of terms obtained using PRF based query expansion. Second, contextual window based approach is used to select the query context related terms from top feedback documents. Third, comparisons were made among baseline, co-occurrence and contextual window based approaches using different performance evaluating metrics. The experiments were performed on benchmark data and the results show significant improvement over baseline approach.

Download Full-text

Incorporating query term dependencies in language models for document retrieval

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03 ◽

10.1145/860435.860523 ◽

2003 ◽

Cited By ~ 4

Author(s):

Munirathnam Srikanth ◽

Rohini Srihari

Keyword(s):

Document Retrieval ◽

Language Models ◽

Query Term

Download Full-text

A Novel Method for Analyzing Best Pages Generated by Query Term Synonym Combination

10.1007/978-981-16-5120-5_33 ◽

2021 ◽

pp. 441-455

Author(s):

L. M. R. J. Lobo ◽

Kale Manoj Birbal

Keyword(s):

Query Term ◽

Novel Method

Download Full-text

Named Entity Based Ranking with Term Proximity for XML Retrieval

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2018040104 ◽

2018 ◽

Vol 8 (2) ◽

pp. 57-77 ◽

Cited By ~ 1

Author(s):

Abubakar Roko ◽

Shyamala Doraisamy ◽

Azreen Azman ◽

Azrul Hazri Jantan

Keyword(s):

Keyword Search ◽

Query Term ◽

Search System ◽

Keyword Query ◽

Xml Retrieval ◽

Named Entity ◽

Data Content ◽

Search Quality ◽

The Impact ◽

Confidence Value

In this article, an indexing scheme that includes the named entity category for each indexed term is proposed. Based on this, two methods are proposed, one to infer the semantics of an XML element based on its data content, called the confidence value of the element, and the second method computes the proximity scores of the query terms. The confidence value of an element is obtained based on the probability of a named entity category in the data content of the underlying XML element. The proximity score of the query terms measures the proximity and ordering of the query term within an XML element. The article then shows how a ranking function uses the confidence value of an XML element and proximity score to mitigate the impact of higher frequency terms and compute the relevance between a keyword query and an XML fragment. Finally, a keyword search system is introduced and experiments show that the proposed system outperforms existing approaches in terms of search quality and achieve a higher efficiency.

Download Full-text

An investigation of the relative influences of database informativeness, query size and query term specificity on the effectiveness of Medline searching

Journal of Information Science ◽

10.1177/016555159502100303 ◽

1995 ◽

Vol 21 (3) ◽

pp. 173-185 ◽

Cited By ~ 2

Author(s):

M.H. Heine

Keyword(s):

Query Term

Download Full-text

Hybrid Pre-Query Term Expansion using Latent Semantic Analysis

Fourth IEEE International Conference on Data Mining (ICDM'04) ◽

10.1109/icdm.2004.10085 ◽

2005 ◽

Cited By ~ 6

Author(s):

L.A.F. Park ◽

K. Ramamohanarao

Keyword(s):

Latent Semantic Analysis ◽

Semantic Analysis ◽

Query Term

Download Full-text

A Study of Query Term Deletion Using Large-Scale E-commerce Search Logs

Lecture Notes in Computer Science - Advances in Information Retrieval ◽

10.1007/978-3-319-06028-6_20 ◽

2014 ◽

pp. 235-246 ◽

Cited By ~ 1

Author(s):

Bishan Yang ◽

Nish Parikh ◽

Gyanit Singh ◽

Neel Sundaresan

Keyword(s):

Large Scale ◽

Query Term ◽

Search Logs

Download Full-text

An adaptive term proximity based rocchio’s model for clinical decision support retrieval

BMC Medical Informatics and Decision Making ◽

10.1186/s12911-019-0986-6 ◽

2019 ◽

Vol 19 (S9) ◽

Cited By ~ 3

Author(s):

Min Pan ◽

Yue Zhang ◽

Qiang Zhu ◽

Bo Sun ◽

Tingting He ◽

...

Keyword(s):

Query Expansion ◽

Window Size ◽

Clinical Decision ◽

Recall Rate ◽

Biomedical Literature ◽

Query Term ◽

Retrieval Models ◽

Candidate Term ◽

Adaptive Parameter ◽

Clinical Support

Abstract Background In order to better help doctors make decision in the clinical setting, research is necessary to connect electronic health record (EHR) with the biomedical literature. Pseudo Relevance Feedback (PRF) is a kind of classical query modification technique that has shown to be effective in many retrieval models and thus suitable for handling terse language and clinical jargons in EHR. Previous work has introduced a set of constraints (axioms) of traditional PRF model. However, in the feedback document, the importance degree of candidate term and the co-occurrence relationship between a candidate term and a query term. Most methods do not consider both of these factors. Intuitively, terms that have higher co-occurrence degree with a query term are more likely to be related to the query topic. Methods In this paper, we incorporate original HAL model into the Rocchio’s model, and propose a new concept of term proximity feedback weight. A HAL-based Rocchio’s model in the query expansion, called HRoc, is proposed. Meanwhile, we design three normalization methods to better incorporate proximity information to query expansion. Finally, we introduce an adaptive parameter to replace the length of sliding window of HAL model, and it can select window size according to document length. Results Based on 2016 TREC Clinical Support medicine dataset, experimental results demonstrate that the proposed HRoc and HRoc_AP models superior to other advanced models, such as PRoc2 and TF-PRF methods on various evaluation metrics. Among them, compared with the Proc2 and TF-PRF models, the MAP of our model is increased by 8.5% and 12.24% respectively, while the F1 score of our model is increased by 7.86% and 9.88% respectively. Conclusions The proposed HRoc model can effectively enhance the precision and the recall rate of Information Retrieval and gets a more precise result than other models. Furthermore, after introducing self-adaptive parameter, the advanced HRoc_AP model uses less hyper-parameters than other models while enjoys an equivalent performance, which greatly improves the efficiency and applicability of the model and thus helps clinicians to retrieve clinical support document effectively.

Download Full-text