scholarly journals A two-stage information retrieval system based on interactive multimodal genetic algorithm for query weight optimization

Author(s):  
Hao Cong ◽  
Wei-Neng Chen ◽  
Wei-Jie Yu

AbstractQuery weight optimization, which aims to find an optimal combination of the weights of query terms for sorting relevant documents, is an important topic in the information retrieval system. Due to the huge search space, the query optimization problem is intractable, and evolutionary algorithms have become one popular approach. But as the size of the database grows, traditional retrieval approaches may return a lot of results, which leads to low efficiency and poor practicality. To solve this problem, this paper proposes a two-stage information retrieval system based on an interactive multimodal genetic algorithm (IMGA) for a query weight optimization system. The proposed IMGA has two stages: quantity control and quality optimization. In the quantity control stage, a multimodal genetic algorithm with the aid of the niching method selects multiple promising combinations of query terms simultaneously by which the numbers of retrieved documents are controlled in an appropriate range. In the quality optimization stage, an interactive genetic algorithm is designed to find the optimal query weights so that the most user-friendly document retrieval sequence can be yielded. Users’ feedback information will accelerate the optimization process, and a genetic algorithm (GA) performs interactively with the action of relevance feedback mechanism. Replacing user evaluation, a mathematical model is built to evaluate the fitness values of individuals. In the proposed two-stage method, not only the number of returned results can be controlled, but also the quality and accuracy of retrieval can be improved. The proposed method is run on the database which with more than 2000 documents. The experimental results show that our proposed method outperforms several state-of-the-art query weight optimization approaches in terms of the precision rate and the recall rate.

Author(s):  
KENJI SAITO ◽  
HIROYUKI SHIOYA ◽  
TSUTOMU DA-TE

We improve a document retrieval method based on the so-called maximum entropy principle proposed by Cooper, and show how to implement this system on a Bayesian network. A Bayesian network is a probabilistic model for expressing probabilistic relations among random variables. We show advantages of a document retrieval system on a Bayesian network in comparison with Cooper's system. The original document retrieval system based on the maximum entropy principle has a drawback: a result of retrieval can not be obtained in some cases. In this paper, we resolve this drawback by fuzzification of user retrieval requests.


Author(s):  
Roberto Willrich ◽  
Rafael de Moura Speroni ◽  
Christopher Viana Lima ◽  
André Luiz de Oliveira Diaz ◽  
Sérgio Murilo Penedo

Author(s):  
Hanene Maghrebi ◽  
Amos David

Managing the increasing growth of multimedia content still poses some problems. The challenge is to propose relevant information to the users among the large volume of information available. The main idea that drives our approach is to provide an open information retrieval system, which can adapt its results to several…La gestion de l’information multimédia soulève encore quelques problèmes. Le défi est de pouvoir proposer à l’utilisateur des informations pertinentes parmi la quantité d’information qui ne cesse de s’accroître. Dans cette lignée, nous proposons un système ouvert de recherche d’information capable d’adapter ses résultats aux différents… 


Sign in / Sign up

Export Citation Format

Share Document