Speeding-Up Graph-Based Keyword Spotting in Historical Handwritten Documents

Filters for graph-based keyword spotting in historical handwritten documents

Pattern Recognition Letters ◽

10.1016/j.patrec.2018.03.030 ◽

2020 ◽

Vol 134 ◽

pp. 125-134 ◽

Cited By ~ 3

Author(s):

Michael Stauffer ◽

Andreas Fischer ◽

Kaspar Riesen

Keyword(s):

Keyword Spotting ◽

Handwritten Documents

Download Full-text

Ensembles for Graph-Based Keyword Spotting in Historical Handwritten Documents

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) ◽

10.1109/icdar.2017.122 ◽

2017 ◽

Cited By ~ 1

Author(s):

Michael Stauffer ◽

Andreas Fischer ◽

Kaspar Riesen

Keyword(s):

Keyword Spotting ◽

Handwritten Documents

Download Full-text

Bayesian background models for keyword spotting in handwritten documents

Pattern Recognition ◽

10.1016/j.patcog.2016.06.030 ◽

2017 ◽

Vol 64 ◽

pp. 84-91 ◽

Cited By ~ 6

Author(s):

Gaurav Kumar ◽

Venu Govindaraju

Keyword(s):

Keyword Spotting ◽

Handwritten Documents ◽

Background Models

Download Full-text

Keyword Spotting from Online Chinese Handwritten Documents Using One-vs-All Trained Character Classifier

2010 12th International Conference on Frontiers in Handwriting Recognition ◽

10.1109/icfhr.2010.49 ◽

2010 ◽

Cited By ~ 10

Author(s):

Heng Zhang ◽

Da-Han Wang ◽

Cheng-Lin Liu

Keyword(s):

Keyword Spotting ◽

Handwritten Documents

Download Full-text

Keyword Spotting in Offline Chinese Handwritten Documents Using a Statistical Model

2011 International Conference on Document Analysis and Recognition ◽

10.1109/icdar.2011.25 ◽

2011 ◽

Cited By ~ 8

Author(s):

Liang Huang ◽

Fei Yin ◽

Qing-Hu Chen ◽

Cheng-Lin Liu

Keyword(s):

Statistical Model ◽

Keyword Spotting ◽

Handwritten Documents

Download Full-text

Keyword Spotting in Online Chinese Handwritten Documents with Candidate Scoring Based on Semi-CRF Model

2013 12th International Conference on Document Analysis and Recognition ◽

10.1109/icdar.2013.118 ◽

2013 ◽

Cited By ~ 4

Author(s):

Heng Zhang ◽

Xiang-Dong Zhou ◽

Cheng-Lin Liu

Keyword(s):

Keyword Spotting ◽

Handwritten Documents

Download Full-text

Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images

Journal of Intelligent Systems ◽

10.1515/jisys-2017-0384 ◽

2018 ◽

Vol 29 (1) ◽

pp. 719-735 ◽

Cited By ~ 2

Author(s):

Samir Malakar ◽

Manosij Ghosh ◽

Ram Sarkar ◽

Mita Nasipuri

Keyword(s):

Feature Vector ◽

Binary Classification ◽

Research Problem ◽

Document Image ◽

Feature Descriptor ◽

Keyword Spotting ◽

Two Stage ◽

Handwritten Documents ◽

And Gender ◽

Searching Method

Abstract Word searching or keyword spotting is an important research problem in the domain of document image processing. The solution to the said problem for handwritten documents is more challenging than for printed ones. In this work, a two-stage word searching schema is introduced. In the first stage, all the irrelevant words with respect to a search word are filtered out from the document page image. This is carried out using a zonal feature vector, called pre-selection feature vector, along with a rule-based binary classification method. In the next step, a holistic word recognition paradigm is used to confirm a pre-selected word as search word. To accomplish this, a modified histogram of oriented gradients-based feature descriptor is combined with a topological feature vector. This method is experimented on a QUWI English database, which is freely available through the International Conference on Document Analysis and Recognition 2015 competition entitled “Writer Identification and Gender Classification.” This technique not only provides good retrieval performance in terms of recall, precision, and F-measure scores, but it also outperforms some state-of-the-art methods.

Download Full-text

Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage

Neural Computing and Applications ◽

10.1007/s00521-019-04429-w ◽

2019 ◽

Vol 32 (13) ◽

pp. 9201-9215

Author(s):

Ahmed Cheikhrouhou ◽

Yousri Kessentini ◽

Slim Kanoun

Keyword(s):

Keyword Spotting ◽

Handwritten Documents ◽

Identification Stage

Download Full-text

Two-stage approach to keyword spotting in handwritten documents

10.1117/12.2042265 ◽

2013 ◽

Author(s):

Mehdi Haji ◽

Mohammad R. Ameri ◽

Tien D. Bui ◽

Ching Y. Suen ◽

Dominique Ponson

Keyword(s):

Keyword Spotting ◽

Two Stage ◽

Handwritten Documents

Download Full-text

KEYWORD SPOTTING FROM ONLINE CHINESE HANDWRITTEN DOCUMENTS USING ONE-VERSUS-ALL CHARACTER CLASSIFICATION MODEL

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001413530017 ◽

2013 ◽

Vol 27 (03) ◽

pp. 1353001 ◽

Cited By ~ 3

Author(s):

HENG ZHANG ◽

DA-HAN WANG ◽

CHENG-LIN LIU ◽

HORST BUNKE

Keyword(s):

Classification Model ◽

Experimental Comparison ◽

Support Vector ◽

Svm Classifier ◽

Keyword Spotting ◽

Handwritten Documents ◽

Handwritten Text Recognition ◽

Adaptive Thresholds ◽

Query Word ◽

Character Classification

In this paper, we propose a method for text-query-based keyword spotting from online Chinese handwritten documents using character classification model. The similarity between the query word and handwriting is obtained by combining the character classification scores. The classifier is trained by one-versus-all strategy so that it gives high similarity to the target class and low scores to the others. Using character classification-based word similarity also helps overcome the out-of-vocabulary (OOV) problem. We use a character-synchronous dynamic search algorithm to efficiently spot the query word in large database. The retrieval performance is further improved by using competing character confusion and writer-adaptive thresholds. Our experimental results on a large handwriting database CASIA-OLHWDB justify the superiority of one-versus-all trained classifiers and the benefits of confidence transformation, character confusion and adaptive thresholds. Particularly, a one-versus-all trained prototype classifier performs as well as a linear support vector machine (SVM) classifier, but consumes much less storage of index file. The experimental comparison with keyword spotting based on handwritten text recognition also demonstrates the effectiveness of the proposed method.

Download Full-text