Evaluation of n-gram conflation approaches for Arabic text retrieval

2009 ◽  
Vol 60 (7) ◽  
pp. 1448-1465 ◽  
Author(s):  
Farag Ahmed ◽  
Andreas Nürnberger
Keyword(s):  
Author(s):  
Turdi Tohti ◽  
Lirui Xu ◽  
Jimmy Huang ◽  
Winira Musajan ◽  
Askar Hamdulla
Keyword(s):  

Author(s):  
Jaffar Atwan ◽  
Masnizah Mohd ◽  
Ghassan Kanaan ◽  
Qusay Bsoul
Keyword(s):  

Author(s):  
Abolfazl Aleahmad ◽  
Parsia Hakimian ◽  
Farzad Mahdikhani ◽  
Farhad Oroumchian

Author(s):  
Waseem Alromima ◽  
Ibrahim F. ◽  
Rania Elgohary ◽  
Mostafa Aref

2014 ◽  
Vol 12 (8) ◽  
pp. 3758-3767 ◽  
Author(s):  
Mostafa Ezzat ◽  
Tarek Ahmed ElGhazaly ◽  
Mervat Gheith

This paper provides a new model aimed to enhanceArabic OCR degraded text retrieval effectiveness. The proposed model based onsimulating the Arabic OCR recognition mistakesbased on both, word based and Character N-Gram approaches. Then we expand the user search query using the expected OCR errors. The resulting search query expanded gives high precision and recall values in searching Arabic OCR-Degraded text rather than the original query. The proposed model showed a significant increase in the degraded text retrieval effectiveness over the previous models. The retrieval effectiveness of the newmodel is %93, while the best effectiveness published for word based approach was %84 and the best effectiveness for character based approach was %56.


Sign in / Sign up

Export Citation Format

Share Document