Image Enhancement of Complex Document Images Using Histogram of Gradient Features

Enhancement of document images is an interesting research challenge in the process of character recognition. It is quite significant to have a document with uniform illumination gradient to achieve higher recognition accuracies through a document processing system like Optical Character Recognition (OCR). Complex document images are one of the varied image categories that are difficult to process compared to other types of images. It is the quality of document that decides the precision of a character recognition system. Hence transforming the complex document images to a uniform illumination gradient is foreseen. In the proposed research, ancient document images of UMIACS Tobacco 800 database are considered for removal of marginal noise. The proposed technique carries out the block wise interpretation of document contents to remove the marginal noise that is present usually at the borders of images. Further, Hu moment’s features are computed for the detection of marginal noise in every block. An empirical analysis is carried out for classification of blocks into noisy or non-noisy and the outcomes produced by algorithm are satisfactory and feasible for subsequent analysis.

Download Full-text

10 mW CMOS retina and classifier for handheld, 1000 images/s optical character recognition system

1999 IEEE International Solid-State Circuits Conference. Digest of Technical Papers. ISSCC. First Edition (Cat. No.99CH36278) ◽

10.1109/isscc.1999.759194 ◽

2003 ◽

Author(s):

P. Masa ◽

P. Heim ◽

E. Franzi ◽

X. Arreguit ◽

F. Heitger ◽

...

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Real‐time optical character recognition on field programmable gate array for automatic number plate recognition system

IET Circuits Devices & Systems ◽

10.1049/iet-cds.2012.0339 ◽

2013 ◽

Vol 7 (6) ◽

pp. 337-344 ◽

Cited By ~ 20

Author(s):

Xiaojun Zhai ◽

Faycal Bensaali ◽

Reza Sotudeh

Keyword(s):

Real Time ◽

Field Programmable Gate Array ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character ◽

Field Programmable ◽

Gate Array

Download Full-text

Optical character recognition system based on a novel fuzzy descriptive features

Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004. ◽

10.1109/icosp.2004.1441471 ◽

2005 ◽

Author(s):

Y. Alginahi ◽

I. El-Feghi ◽

M. Ahmadi ◽

M.A. Sid-Ahmed

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Developing Automated Optical Character Recognition System Using Machine Learning Algorithm to Solve Payment Verification Issues

10.1109/icoris52787.2021.9649514 ◽

2021 ◽

Author(s):

Michael Siek ◽

Rafi Soeharto

Keyword(s):

Machine Learning ◽

Character Recognition ◽

Optical Character Recognition ◽

Learning Algorithm ◽

Recognition System ◽

Machine Learning Algorithm ◽

Optical Character

Download Full-text

Kurzweil Reading Machine: A Partial Evaluation of Its Optical Character Recognition Error Rate

Journal of Visual Impairment & Blindness ◽

10.1177/0145482x7907301002 ◽

1979 ◽

Vol 73 (10) ◽

pp. 389-399

Author(s):

Gregory L. Goodrich ◽

Richard R. Bennett ◽

William R. De L'aune ◽

Harvey Lauer ◽

Leonard Mowinski

Keyword(s):

Error Rate ◽

Character Recognition ◽

Optical Character Recognition ◽

Partial Evaluation ◽

Error Rates ◽

Recognition Error ◽

Optical Character ◽

Printed Materials ◽

High Level

This study was designed to assess the Kurzweil Reading Machine's ability to read three different type styles produced by five different means. The results indicate that the Kurzweil Reading Machines tested have different error rates depending upon the means of producing the copy and upon the type style used; there was a significant interaction between copy method and type style. The interaction indicates that some type styles are better read when the copy is made by one means rather than another. Error rates varied between less than one percent and more than twenty percent. In general, the user will find that high quality printed materials will be read with a relatively high level of accuracy, but as the quality of the material decreases, the number of errors made by the machine also increases. As this error rate increases, the user will find it increasingly difficult to understand the spoken output.

Download Full-text

Multi-Oriented Text Extraction in Stylistic Documents

International Journal of Image and Graphics ◽

10.1142/s0219467815500023 ◽

2015 ◽

Vol 15 (01) ◽

pp. 1550002

Author(s):

Brij Mohan Singh ◽

Rahul Sharma ◽

Debashis Ghosh ◽

Ankush Mittal

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Morphological Operations ◽

Document Images ◽

Font Size ◽

Text Extraction ◽

Optical Character ◽

Engineering Drawings ◽

Flood Fill

In many documents such as maps, engineering drawings and artistic documents, etc. there exist many printed as well as handwritten materials where text regions and text-lines are not parallel to each other, curved in nature, and having various types of text such as different font size, text and non-text areas lying close to each other and non-straight, skewed and warped text-lines. Optical character recognition (OCR) systems available commercially such as ABYY fine reader and Free OCR, are not capable of handling different ranges of stylistic document images containing curved, multi-oriented, and stylish font text-lines. Extraction of individual text-lines and words from these documents is generally not straight forward. Most of the segmentation works reported is on simple documents but still it remains a highly challenging task to implement an OCR that works under all possible conditions and gives highly accurate results, especially in the case of stylistic documents. This paper presents dilation and flood fill morphological operations based approach that extracts multi-oriented text-lines and words from the complex layout or stylistic document images in the subsequent stages. The segmentation results obtained from our method proves to be superior over the standard profiling-based method.

Download Full-text