scholarly journals A semantic approach to post-retrieval query performance prediction

2022 ◽  
Vol 59 (1) ◽  
pp. 102746
Author(s):  
Parastoo Jafarzadeh ◽  
Faezeh Ensan
Author(s):  
Keyvan Sasani ◽  
Mohammad Hossein Namaki ◽  
Yinghui Wu ◽  
Assefaw H. Gebremedhin

2017 ◽  
Vol 53 (6) ◽  
pp. 1320-1341 ◽  
Author(s):  
Maram Hasanain ◽  
Tamer Elsayed

2021 ◽  
Author(s):  
Arabzadehghahyazi Negar

file:///C:/Users/MWF/Downloads/Arabzadehghahyazi, Negar.Pre-retrieval Query Performance Prediction (QPP) methods are oblivious to the performance of the retrieval model as they predict query difficulty prior to observing the set of documents retrieved for the query. Among pre-retrieval query performance predictors, specificity-based metrics investigate how corpus, query and corpus-query level statistics can be used to predict the performance of the query. In this thesis, we explore how neural embeddings can be utilized to define corpus-independent and semantics-aware specificity metrics. Our metrics are based on the intuition that a term that is closely surrounded by other terms in the embedding space is more likely to be specific while a term surrounded by less closely related terms is more likely to be generic. On this basis, we leverage geometric properties between embedded terms to define four groups of metrics: (1) neighborhood-based, (2) graph-based, (3) cluster-based and (4) vector-based metrics. Moreover, we employ learning-to-rank techniques to analyze the importance of individual specificity metrics. To evaluate the proposed metrics, we have curated and publicly share a test collection of term specificity measurements defined based on Wikipedia category hierarchy and DMOZ taxonomy. We report on our extensive experiments on the effectiveness of our metrics through metric comparison, ablation study and comparison against the state-of-the-art baselines. We have shown that our proposed set of pre-retrieval QPP metrics based on the properties of pre-trained neural embeddings are more effective for performance prediction compared to the state-of-the-art methods. We report our findings based on Robust04, ClueWeb09 and Gov2 corpora and their associated TREC topics.


Author(s):  
Negar Arabzadeh ◽  
Amin Bigdeli ◽  
Morteza Zihayat ◽  
Ebrahim Bagheri

Author(s):  
Guglielmo Faggioli ◽  
Oleg Zendel ◽  
J. Shane Culpepper ◽  
Nicola Ferro ◽  
Falk Scholer

2008 ◽  
Vol 23 (4) ◽  
pp. 590-601 ◽  
Author(s):  
Hao Lang ◽  
Bin Wang ◽  
Gareth Jones ◽  
Jin-Tao Li ◽  
Fan Ding ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document