AQtpUIR: Adaptive query term proximity based user information retrieval

Pseudo-relevance feedback (PRF) is a type of relevance feedback approach of query expansion that considers the top ranked retrieved documents as relevance feedback. In this paper the authors focus is to capture the limitation of co-occurrence and PRF based query expansion approach and the authors proposed a hybrid method to improve the performance of PRF based query expansion by combining query term co-occurrence and query terms contextual information based on corpus of top retrieved feedback documents in first pass. Firstly, the paper suggests top retrieved feedback documents based query term co-occurrence approach to select an optimal combination of query terms from a pool of terms obtained using PRF based query expansion. Second, contextual window based approach is used to select the query context related terms from top feedback documents. Third, comparisons were made among baseline, co-occurrence and contextual window based approaches using different performance evaluating metrics. The experiments were performed on benchmark data and the results show significant improvement over baseline approach.

Download Full-text

Information Retrieval by Modified Term Weighting Method Using Random Walk Model with Query Term Position Ranking

2009 International Conference on Signal Processing Systems ◽

10.1109/icsps.2009.122 ◽

2009 ◽

Cited By ~ 3

Author(s):

Abu Shamim Mohammad Arif ◽

Md Masudur Rahman ◽

Shamima Yeasmin Mukta

Keyword(s):

Information Retrieval ◽

Random Walk ◽

Random Walk Model ◽

Query Term ◽

Term Weighting ◽

Weighting Method

Download Full-text

Enhanced Spoken Sentence Retrieval Using a Conventional Automatic Speech Recognizer in Smart Home

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213016500172 ◽

2016 ◽

Vol 25 (03) ◽

pp. 1650017 ◽

Cited By ~ 1

Author(s):

Hyeokju Ahn ◽

Harksoo Kim

Keyword(s):

Information Retrieval ◽

Smart Home ◽

Retrieval System ◽

Generation Model ◽

Query Term ◽

Post Processing ◽

Retrieval Systems ◽

Sentence Retrieval ◽

Information Retrieval Systems ◽

Performance Results

With the rapid evolution of smart home environment, the demand for spoken information retrieval (e.g., voice-activated FAQ retrieval) on information appliances is increasing. In spoken information retrieval, users’ spoken queries are converted into text queries using automatic speech recognition (ASR) engines. If top-1 results of the ASR engines are incorrect, the errors are propagated to information retrieval systems. If a document collection is a small set of sentences such as frequently asked questions (FAQs), the errors have additional effect on the performance of information retrieval systems. To improve the performance of such a sentence retrieval system, we propose a post-processing model of an ASR engine. The post-processing model consists of a re-ranking and a query term generation model. The re-ranking model rearranges top-n outputs of the ASR engines using the ranking support vector machine (Ranking SVM). The query term generation model extracts meaningful content words from the re-ranked queries based on term frequencies and query rankings. In the experiments, the re-ranking model improved the top-1 performance results of an underlying ASR engine with 4.4% higher precision and 6.4% higher recall rate. The query term generation model improved the performance results of an underlying information retrieval system with an accuracy 2.4% to 2.6% higher. Based on the experimental result, the proposed model revealed that it could improve the performance of a spoken sentence retrieval system in a restricted domain.

Download Full-text

Query term disambiguation for Web cross-language information retrieval using a search engine

Proceedings of the fifth international workshop on on Information retrieval with Asian languages - IRAL '00 ◽

10.1145/355214.355218 ◽

2000 ◽

Cited By ~ 20

Author(s):

Akira Maeda ◽

Fatiha Sadat ◽

Masatoshi Yoshikawa ◽

Shunsuke Uemura

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Query Term ◽

Cross Language Information Retrieval ◽

Cross Language

Download Full-text

Analysis of Vector Space Method in Information Retrieval for Smart Answering System

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9099 ◽

2020 ◽

Vol 17 (9) ◽

pp. 4468-4472

Author(s):

Deepa Yogish ◽

T. N. Manjunath ◽

Ravindra S. Hegadi

Keyword(s):

Information Retrieval ◽

Vector Space ◽

Vector Space Model ◽

Query Term ◽

Frequency Method ◽

Document Ranking ◽

User Intent ◽

Space Model ◽

Relevant Document ◽

User Query

In the world of internet, searching play a vital role to retrieve the relevant answers for the user specific queries. The most promising application of natural language processing and information retrieval system is Question answering system which provides directly the accurate answer instead of set of documents. The main objective of information retrieval is to retrieve relevant document from a huge volume of data sets underlying in the internet using appropriatemodel. There are many models proposed for retrieval process such as Boolean, Vector space and Probabilistic method. Vector space model is best method in information retrieval for document ranking with efficient document representation which combines simplicity and clarity. VSM adopts similarity function to measure the matching between documents and user intent, and assign scores from the biggest to smallest. The documents and query are assigned with weights using term frequency and inverse document frequency method. To retrieve most relevant document to the user query term, document ranking function cosine similarity score is applied for every document and user query. The documents having more similarity scores will be considered as relevant documents to the query term and they are ranked based on these scores. This paper emphasizes on different techniques of information retrieval and Vector Space Model offers a realistic compromise in IR processing. It allows best weighing scheme which ranks the set of documents in order of relevance based on user query.

Download Full-text

Query term weights as constraints in fuzzy information retrieval

Information Processing & Management ◽

10.1016/0306-4573(91)90028-k ◽

1991 ◽

Vol 27 (1) ◽

pp. 15-26 ◽

Cited By ~ 48

Author(s):

G. Bordogna ◽

P. Carrara ◽

G. Pasi

Keyword(s):

Information Retrieval ◽

Query Term ◽

Fuzzy Information ◽

Term Weights ◽

Fuzzy Information Retrieval

Download Full-text

Context Window Based Co-Occurrence Approach for Improving Feedback Based Query Expansion in Information Retrieval

Information Retrieval and Management ◽

10.4018/978-1-5225-5191-1.ch072 ◽

2018 ◽

pp. 1597-1613

Author(s):

Jagendra Singh ◽

Aditi Sharan

Keyword(s):

Information Retrieval ◽

Hybrid Method ◽

Relevance Feedback ◽

Query Expansion ◽

Contextual Information ◽

Optimal Combination ◽

Query Term ◽

Benchmark Data ◽

Baseline Approach ◽

Pseudo Relevance Feedback

Pseudo-relevance feedback (PRF) is a type of relevance feedback approach of query expansion that considers the top ranked retrieved documents as relevance feedback. In this paper the authors focus is to capture the limitation of co-occurrence and PRF based query expansion approach and the authors proposed a hybrid method to improve the performance of PRF based query expansion by combining query term co-occurrence and query terms contextual information based on corpus of top retrieved feedback documents in first pass. Firstly, the paper suggests top retrieved feedback documents based query term co-occurrence approach to select an optimal combination of query terms from a pool of terms obtained using PRF based query expansion. Second, contextual window based approach is used to select the query context related terms from top feedback documents. Third, comparisons were made among baseline, co-occurrence and contextual window based approaches using different performance evaluating metrics. The experiments were performed on benchmark data and the results show significant improvement over baseline approach.

Download Full-text

A combined statistical query term disambiguation in cross-language information retrieval

Proceedings. 13th International Workshop on Database and Expert Systems Applications ◽

10.1109/dexa.2002.1045907 ◽

2004 ◽

Cited By ~ 4

Author(s):

F. Sadat ◽

A. Maeda ◽

M. Yoshikawa ◽

S. Uemura

Keyword(s):

Information Retrieval ◽

Query Term ◽

Cross Language Information Retrieval ◽

Cross Language

Download Full-text

A study on query terms proximity embedding for information retrieval

International Journal of Distributed Sensor Networks ◽

10.1177/1550147717694891 ◽

2017 ◽

Vol 13 (2) ◽

pp. 155014771769489 ◽

Cited By ~ 7

Author(s):

Ya-nan Qiao ◽

Qinghe Du ◽

Di-fang Wan

Keyword(s):

Wireless Networks ◽

Information Retrieval ◽

Cyber Physical Systems ◽

Query Term ◽

Retrieval Models ◽

Physical Systems ◽

Retrieval Systems ◽

Information Retrieval Systems ◽

Original Information ◽

Term Field

Information retrieval is applied widely to models and algorithms in wireless networks for cyber-physical systems. Query terms proximity has proved that it is a very useful information to improve the performance of information retrieval systems. Query terms proximity cannot retrieve documents independently, and it must be incorporated into original information retrieval models. This article proposes the concept of query term proximity embedding, which is a new method to incorporate query term proximity into original information retrieval models. Moreover, term-field-convolutions frequency framework, which is an implementation of query term proximity embedding, is proposed in this article, and experimental results show that this framework can improve the performance effectively compared with traditional proximity retrieval models.

Download Full-text

Similarity Web Pages Retrieval Technologies on the Internet

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch440 ◽

2005 ◽

pp. 2486-2491

Author(s):

Rung Ching Chen ◽

Ming Yung Tsai ◽

Chung Hsun Hsieh

Keyword(s):

Information Retrieval ◽

Search Engine ◽

Search Engines ◽

Fast Growth ◽

The Other ◽

The Internet ◽

Web Pages ◽

Query Term ◽

Web Page ◽

Critical Problems

In recent years, due to the fast growth of the Internet, the services and information it provides are constantly expanding. Madria and Bhowmick (1999) and Baeza-Yates (2003) indicated that most large search engines need to comply to, on average, at least millions of hits daily in order to satisfy the users’ needs for information. Each search engine has its own sorting policy and the keyword format for the query term, but there are some critical problems. The searches may get more or less information. In the former, the user always gets buried in the information. Requiring only a little information, they always select some former items from the large amount of returned information. In the latter, the user always re-queries using another searching keyword to do searching work. The re-query operation also leads to retrieving information in a great amount, which leads to having a large amount of useless information. That is a bad cycle of information retrieval. The similarity Web page retrieval can help avoid browsing the useless information. The similarity Web page retrieval indicates a Web page, and then compares the page with the other Web pages from the searching results of search engines. The similarity Web page retrieval will allow users to save time by not browsing unrelated Web pages and reject non-similar Web pages, rank the similarity order of Web pages and cluster the similarity Web pages into the same classification.

Download Full-text