Information Retrieval by Modified Term Weighting Method Using Random Walk Model with Query Term Position Ranking

Author(s):  
Abu Shamim Mohammad Arif ◽  
Md Masudur Rahman ◽  
Shamima Yeasmin Mukta
2007 ◽  
Vol 01 (04) ◽  
pp. 421-439 ◽  
Author(s):  
SAMER HASSAN ◽  
RADA MIHALCEA ◽  
CARMEN BANEA

This paper describes a new approach for estimating term weights in a document, and shows how the new weighting scheme can be used to improve the accuracy of a text classifier. The method uses term co-occurrence as a measure of dependency between word features. A random walk model is applied on a graph encoding words and co-occurrence dependencies, resulting in scores that represent a quantification of how a particular word feature contributes to a given context. Experiments performed on three standard classification datasets show that the new random walk based approach outperforms the traditional term frequency approach of feature weighting.


Author(s):  
M. Ali Fauzi ◽  
Agus Zainal Arifin ◽  
Anny Yuniarti

One of the most common issue in information retrieval is documents ranking. Documents ranking system collects search terms from the user and orderly retrieves documents based on the relevance. Vector space models based on TF.IDF term weighting is the most common method for this topic. In this study, we are concerned with the study of automatic retrieval of Islamic <em>Fiqh</em> (Law) book collection. This collection contains many books, each of which has tens to hundreds of pages. Each page of the book is treated as a document that will be ranked based on the user query. We developed class-based indexing method called inverse class frequency (ICF) and book-based indexing method inverse book frequency (IBF) for this Arabic information retrieval. Those method then been incorporated with the previous method so that it becomes TF.IDF.ICF.IBF. The term weighting method also used for feature selection due to high dimensionality of the feature space. This novel method was tested using a dataset from 13 Arabic Fiqh e-books. The experimental results showed that the proposed method have the highest precision, recall, and F-Measure than the other three methods at variations of feature selection. The best performance of this method was obtained when using best 1000 features by precision value of 76%, recall value of 74%, and F-Measure value of 75%.


2013 ◽  
Vol 17 (2) ◽  
pp. 153-176 ◽  
Author(s):  
İlker Kocabaş ◽  
Bekir Taner Dinçer ◽  
Bahar Karaoğlan

2010 ◽  
Vol 33 (8) ◽  
pp. 1418-1426 ◽  
Author(s):  
Wei ZHENG ◽  
Chao-Kun WANG ◽  
Zhang LIU ◽  
Jian-Min WANG

Sign in / Sign up

Export Citation Format

Share Document