scholarly journals Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents

Author(s):  
Michael Stauffer ◽  
Andreas Fischer ◽  
Kaspar Riesen
2020 ◽  
Vol 134 ◽  
pp. 125-134 ◽  
Author(s):  
Michael Stauffer ◽  
Andreas Fischer ◽  
Kaspar Riesen

2018 ◽  
Vol 29 (1) ◽  
pp. 719-735 ◽  
Author(s):  
Samir Malakar ◽  
Manosij Ghosh ◽  
Ram Sarkar ◽  
Mita Nasipuri

Abstract Word searching or keyword spotting is an important research problem in the domain of document image processing. The solution to the said problem for handwritten documents is more challenging than for printed ones. In this work, a two-stage word searching schema is introduced. In the first stage, all the irrelevant words with respect to a search word are filtered out from the document page image. This is carried out using a zonal feature vector, called pre-selection feature vector, along with a rule-based binary classification method. In the next step, a holistic word recognition paradigm is used to confirm a pre-selected word as search word. To accomplish this, a modified histogram of oriented gradients-based feature descriptor is combined with a topological feature vector. This method is experimented on a QUWI English database, which is freely available through the International Conference on Document Analysis and Recognition 2015 competition entitled “Writer Identification and Gender Classification.” This technique not only provides good retrieval performance in terms of recall, precision, and F-measure scores, but it also outperforms some state-of-the-art methods.


2013 ◽  
Author(s):  
Mehdi Haji ◽  
Mohammad R. Ameri ◽  
Tien D. Bui ◽  
Ching Y. Suen ◽  
Dominique Ponson

Author(s):  
HENG ZHANG ◽  
DA-HAN WANG ◽  
CHENG-LIN LIU ◽  
HORST BUNKE

In this paper, we propose a method for text-query-based keyword spotting from online Chinese handwritten documents using character classification model. The similarity between the query word and handwriting is obtained by combining the character classification scores. The classifier is trained by one-versus-all strategy so that it gives high similarity to the target class and low scores to the others. Using character classification-based word similarity also helps overcome the out-of-vocabulary (OOV) problem. We use a character-synchronous dynamic search algorithm to efficiently spot the query word in large database. The retrieval performance is further improved by using competing character confusion and writer-adaptive thresholds. Our experimental results on a large handwriting database CASIA-OLHWDB justify the superiority of one-versus-all trained classifiers and the benefits of confidence transformation, character confusion and adaptive thresholds. Particularly, a one-versus-all trained prototype classifier performs as well as a linear support vector machine (SVM) classifier, but consumes much less storage of index file. The experimental comparison with keyword spotting based on handwritten text recognition also demonstrates the effectiveness of the proposed method.


Sign in / Sign up

Export Citation Format

Share Document