Improving OCR-Degraded Arabic Text Retrieval Through an Enhanced Orthographic Query Expansion Model

Author(s):  
Tarek Elghazaly
Author(s):  
Waseem Alromima ◽  
Ibrahim F. ◽  
Rania Elgohary ◽  
Mostafa Aref

2018 ◽  
Vol 21 (4) ◽  
pp. 337-367 ◽  
Author(s):  
Meriem Amina Zingla ◽  
Chiraz Latiri ◽  
Philippe Mulhem ◽  
Catherine Berrut ◽  
Yahya Slimani

2014 ◽  
Vol 977 ◽  
pp. 464-467
Author(s):  
Li Xin Gan ◽  
Wei Tu

Query expansion is one of the key technologies for improving precision and recall in information retrieval. In order to overcome limitations of single corpus, in this paper, semantic characteristics of Wikipedia corpus is combined with the standard corpus to extract more rich relationship between terms for construction of a steady Markov semantic network. Information of the entity pages and disambiguation pages in Wikipedia is comprehensively utilized to classify query terms to improve query classification accuracy. Related candidates with high quality can be used for query expansion according to semantic pruning. The proposal in our work is benefit to improve retrieval performance and to save search computational cost.


1998 ◽  
Vol 4 (1) ◽  
pp. 41-55 ◽  
Author(s):  
STEFAN LANGER ◽  
MARIANNE HICKEY

In this paper, we present results of a project that investigated the application of lexicon based text retrieval techniques to Alternative and Augmentative Communication (AAC). As a practical outcome of this research, a communication aid based on message retrieval by key words was designed, implemented and evaluated. The message retrieval module in the system uses a large semantic lexicon, derived from the WordNet database, for query expansion. Trials have been carried out with the device to evaluate whether the approach is suitable for AAC, and to determine the semantic relations that lead to efficient message retrieval. The first part of this paper describes the background of the project and highlights the retrieval requirements for a communication aid, which differ considerably from the requirements in standard text retrieval. We then present the overall design of the WordKeys communication aid and describe the tasks of its sub-modules. We summarise trials that have been carried out to determine the effect of semantic query expansion on the success of message retrieval. Evaluation results show that information about word frequency can solve problems that occurred in the semantic query expansion because of taxonomies that have too many intermediate steps between closely related words. Finally, a user evaluation with the improved system showed that full text retrieval is an effective approach to message access in a communication aid.


Sign in / Sign up

Export Citation Format

Share Document