Scale Space Co-Occurrence HOG Features for Word Spotting in Handwritten Document Images

C. Thontadari; C. J. Prabhakar

doi:10.4018/ijcvip.2016070105

Scale Space Co-Occurrence HOG Features for Word Spotting in Handwritten Document Images

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2016070105 ◽

2016 ◽

Vol 6 (2) ◽

pp. 71-86 ◽

Cited By ~ 4

Author(s):

C. Thontadari ◽

C. J. Prabhakar

Keyword(s):

Spatial Information ◽

Scale Parameter ◽

Poor Performance ◽

Scale Space ◽

Feature Descriptor ◽

Word Spotting ◽

Handwritten Documents ◽

Histograms Of Oriented Gradients ◽

The Poor ◽

Handwritten Document

In this paper, the authors proposed a Scale Space Co-occurrence Histograms of Oriented Gradients method (SS Co-HOG) for retrieving words from digitized handwritten documents. The poor performance of HOG based word spotting in handwritten documents is due to that HOG ignores spatial information of neighboring pixels whereas Co-HOG captures the spatial information of neighboring pixels through counting the occurrence of the gradient orientations of two or more neighboring pixels. The authors employed three scale parameter representation of an image and at each scale, they divide the word image into blocks and Co-HOG features are extracted from each block and finally concatenate them into form a feature descriptor. The proposed method is evaluated using precision and recall metrics through experimentation conducted on popular datasets such as IAM and GW and confirmed that their method outperforms for both the datasets.

Download Full-text

Segmentation-Free Word Spotting in Handwritten Documents Using Scale Space Co-HoG Feature Descriptors

Advances in Computational Intelligence and Robotics - Applications of Advanced Machine Intelligence in Computer Vision and Object Recognition ◽

10.4018/978-1-7998-2736-8.ch009 ◽

2020 ◽

pp. 219-247

Author(s):

Prabhakar C. J.

Keyword(s):

Scale Space ◽

Word Segmentation ◽

Literature Survey ◽

Feature Descriptor ◽

Word Spotting ◽

Handwritten Documents ◽

Histograms Of Oriented Gradients ◽

Feature Descriptors ◽

Increase In Accuracy ◽

Free Word

In this chapter, the author present a segmentation-free-based word spotting method for handwritten documents using Scale Space co-occurrence histograms of oriented gradients (Co-HOG) feature descriptor. The chapter begin with introduction to word spotting, its challenges, and applications. It is followed by review of the existing techniques for word spotting in handwritten documents. The literature survey reveals that segmentation-based word spotting methods usually need a layout analysis step for word segmentation, and any segmentation errors can affect the subsequent word representations and matching steps. Hence, in order to overcome the drawbacks of segmentation-based methods, the author proposed segmentation-free word spotting using Scale Space Co-HOG feature descriptor. The proposed method is evaluated using mean Average Precision (mAP) through experimentation conducted on popular datasets such as GW and IAM. The performance of the proposed method is compared with existing state-of-the-segmentation and segmentation-free methods, and there is a considerable increase in accuracy.

Download Full-text

Segmentation Free Word Spotting for Handwritten Documents Using Bag of Visual Words Based on Co-HOG Descriptor

International Journal of Information Retrieval Research ◽

10.4018/ijirr.2019040105 ◽

2019 ◽

Vol 9 (2) ◽

pp. 49-65

Author(s):

Thontadari C. ◽

Prabhakar C. J.

Keyword(s):

Visual Information ◽

Spatial Information ◽

Spatial Location ◽

Visual Word ◽

Bag Of Visual Words ◽

Word Spotting ◽

Handwritten Documents ◽

Visual Words ◽

Handwritten Document ◽

Free Word

In this article, the authors propose a segmentation-free word spotting in handwritten document images using a Bag of Visual Words (BoVW) framework based on the co-occurrence histogram of oriented gradient (Co-HOG) descriptor. Initially, the handwritten document is represented using visual word vectors which are obtained based on the frequency of occurrence of Co-HOG descriptor within local patches of the document. The visual word representation vector does not consider their spatial location and spatial information helps to determine a location exclusively with visual information when the different location can be perceived as the same. Hence, to add spatial distribution information of visual words into the unstructured BoVW framework, the authors adopted spatial pyramid matching (SPM) technique. The performance of the proposed method evaluated using popular datasets and it is confirmed that the authors' method outperforms existing segmentation free word spotting techniques.

Download Full-text

Bag of Visual Words Based on Co-HOG Features for Word Spotting in Handwritten Documents

Advancements in Computer Vision and Image Processing - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-5225-5628-2.ch007 ◽

2018 ◽

pp. 162-189

Author(s):

Thontadari C. ◽

Prabhakar C. J.

Keyword(s):

Spatial Information ◽

Bag Of Visual Words ◽

Shape Information ◽

Word Spotting ◽

Handwritten Documents ◽

Visual Words ◽

Gradient Orientation ◽

Handwritten Document ◽

Image Shape ◽

Pyramid Matching

In this chapter, the authors present a segmentation-based word spotting method for handwritten documents using bag of visual words (BoVW) framework based on co-occurrence histograms of oriented gradients (Co-HOG) features. The Co-HOG descriptor captures the word image shape information and encodes the local spatial information by counting the co-occurrence of gradient orientation of neighbor pixel pairs. The handwritten document images are segmented into words and each word image is represented by a vector that contains the frequency of visual words appeared in the image. In order to include spatial information to the BoVW framework, the authors adopted spatial pyramid matching (SPM) method. The proposed method is evaluated using precision and recall metrics through experimentation conducted on popular datasets such as GW and IAM. The performance analysis confirmed that the method outperforms existing word spotting techniques.

Download Full-text

Word Spotting Based on Bispace Similarity for Visual Information Retrieval in Handwritten Document Images

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2019070103 ◽

2019 ◽

Vol 9 (3) ◽

pp. 38-58 ◽

Cited By ~ 1

Author(s):

Ryma Benabdelaziz ◽

Djamel Gaceb ◽

Mohammed Haddad

Keyword(s):

Visual Information ◽

Visual Features ◽

Keyword Spotting ◽

Visual Information Retrieval ◽

Word Spotting ◽

Handwritten Documents ◽

Handwritten Document ◽

Image Gradients ◽

Point Detection ◽

Accurate Matching

Retrieving information from a huge collection of ancient handwritten documents is important for indexing, interpreting, browsing, and searching documents in various domains. Word spotting approaches are widely used in this context but have several limitations related to the complex properties of handwriting. These can appear at several steps: interest point detection, description, and matching. This article proposes a new word spotting approach for the word retrieval in handwritten document, which mainly leverages the properties of image gradients for visual features detection and description. The proposed approach is based on the combination of spatial relationships with textural information to design a more accurate matching. The experimental results of the proposed approach demonstrate a higher performance over the Jeremy Bentham dataset, evaluated following the recent benchmarks of ICDAR 2015 Competition on Keyword Spotting for Handwritten Documents.

Download Full-text

Word Spotting in Handwritten Document Images based on Multiple Features

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l2625.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 3527-3537

Keyword(s):

Real Time ◽

Local Binary Pattern ◽

Document Images ◽

Morphological Filters ◽

Multiple Features ◽

Word Spotting ◽

Handwritten Documents ◽

The Real ◽

Handwritten Document ◽

Printed Text

This paper presents word spotting in handwritten documents based on multiple features. Multiple features are derived using Gabor, Histogram oriented gradient (HOG), Local binary pattern, texture filters and Morphological filters. The real time documents are heterogeneous in nature, for instance application forms, postal cards, railway reservations forms etc. includes handwritten and printed text with different scripts. To spot a word in such documents and retrieving them from a huge digitized repository is a challenging task. To address such issues word spotting based on multiple features is carried out with learning and without learning methods. In both the methods (learning and learning free) texture filters are exhibiting outstanding performance in terms of precision recall and f-measures. To confirm the capability of the proposed method, extensive experiments are made on publically available dataset i.e.GW20 and noted encouraging results compared to other contemporary works

Download Full-text

Partially Specified Actuarial Tables and the Poor Performance of Static-99R

PsycEXTRA Dataset ◽

10.1037/e571212013-372 ◽

2013 ◽

Author(s):

Richard Wollert ◽

Jacqueline Waggoner

Keyword(s):

Poor Performance ◽

The Poor

Download Full-text

The Roma in Post-Communist Bulgaria: Growing Social Marginalization and State Policies

Journal of Asian Social Science Research ◽

10.15575/jassr.v2i1.1 ◽

2020 ◽

Vol 2 (1) ◽

pp. 1-24

Author(s):

Yorgos Christidis

Keyword(s):

Political Parties ◽

Social Exclusion ◽

Hate Crime ◽

Hate Speech ◽

Poor Performance ◽

State Policies ◽

Social Marginalization ◽

Popular Support ◽

Economic Problems ◽

The Poor

This article analyzes the growing impoverishment and marginalization of the Roma in Bulgarian society and the evolution of Bulgaria’s post-1989 policies towards the Roma. It examines the results of the policies so far and the reasons behind the “poor performance” of the policies implemented. It is believed that Post-communist Bulgaria has successfully re-integrated the ethnic Turkish minority given both the assimilation campaign carried out against it in the 1980s and the tragic events that took place in ex-Yugoslavia in the 1990s. This Bulgaria’s successful “ethnic model”, however, has failed to include the Roma. The “Roma issue” has emerged as one of the most serious and intractable ones facing Bulgaria since 1990. A growing part of its population has been living in circumstances of poverty and marginalization that seem only to deteriorate as years go by. State policies that have been introduced since 1999 have failed at large to produce tangible results and to reverse the socio-economic marginalization of the Roma: discrimination, poverty, and social exclusion continue to be the norm. NGOs point out to the fact that many of the measures that have been announced have not been properly implemented, and that legislation existing to tackle discrimination, hate crime, and hate speech is not implemented. Bulgaria’s political parties are averse in dealing with the Roma issue. Policies addressing the socio-economic problems of the Roma, including hate speech and crime, do not enjoy popular support and are seen as politically damaging.

Download Full-text

THE REASONS BEHIND THE POOR PERFORMANCE OF SAUDI STUDENTS IN IELTS

i-manager’s Journal on English Language Teaching ◽

10.26634/jelt.9.1.15375 ◽

2019 ◽

Vol 9 (1) ◽

pp. 38

Author(s):

ALZAHRANI MOHSEN ◽

Keyword(s):

Poor Performance ◽

Saudi Students ◽

The Poor

Download Full-text

Biomass Characterization in a Nitrification-Denitrification Biological Enhanced Phosphorus Removal (NDBEPR) Plant during Start-Up and Subsequent Periods of Good and Poor Phosphorus Removal

Water Science & Technology ◽

10.2166/wst.1994.0316 ◽

1994 ◽

Vol 29 (7) ◽

pp. 91-100 ◽

Cited By ~ 7

Author(s):

K. C. Lindrea ◽

S. P. Pigdon ◽

B. Boyd ◽

G. A. Lockwood

Keyword(s):

Phosphorus Removal ◽

Poor Performance ◽

Intracellular Distribution ◽

Full Scale ◽

Laboratory Scale ◽

Plant Operation ◽

The Poor ◽

Start Up ◽

Biomass Characterization

During commissioning and process stabilization of a NDBEPR plant at Bendigo intracellular distribution and movement of phosphorus, K+, Mg2+ and Ca2+ was followed to establish the nature of biomass development. The system was also monitored at the end of a period of breakdown of the BEPR process and during its return to phosphorus removal. Phosphorus (P) and Mg2+ distribution in the biomass were closely related during all phases of plant operation, and laboratory trials indicated that the poor performance of the full-scale plant was associated with seasonal reduction in influent Mg2+. Laboratory scale trials produced a similar effect when the influent Mg2+ was limited to concentrations much lower than those experienced in the full scale plant, but only after the Mg2+ and P reserves in the biomass were depleted. The distribution of P, K+, Mg2+ and Ca2+ in the biomass from the full scale plant was similar to that seen in the laboratory trials when cations in the feed were severely limited and recovery of the full scale plant also closely matched that of the laboratory scale system.

Download Full-text

A voting-based technique for word spotting in handwritten document images

Multimedia Tools and Applications ◽

10.1007/s11042-020-10363-0 ◽

2021 ◽

Author(s):

Shamik Majumder ◽

Subhrangshu Ghosh ◽

Samir Malakar ◽

Ram Sarkar ◽

Mita Nasipuri

Keyword(s):

Document Images ◽

Word Spotting ◽

Handwritten Document

Download Full-text