High accuracy optical character recognition algorithms using learning array of ANN

AbstractIn recent years, $$\hbox {optical character recognition (OCR)}$$ optical character recognition (OCR) systems have been used to digitally preserve historical archives. To transcribe historical archives into a machine-readable form, first, the documents are scanned, then an $$\hbox {OCR}$$ OCR is applied. In order to digitize documents without the need to remove them from where they are archived, it is valuable to have a portable device that combines scanning and $$\hbox {OCR}$$ OCR capabilities. Nowadays, there exist many commercial and open-source document digitization techniques, which are optimized for contemporary documents. However, they fail to give sufficient text recognition accuracy for transcribing historical documents due to the severe quality degradation of such documents. On the contrary, the anyOCR system, which is designed to mainly digitize historical documents, provides high accuracy. However, this comes at a cost of high computational complexity resulting in long runtime and high power consumption. To tackle these challenges, we propose a low power energy-efficient accelerator with real-time capabilities called iDocChip, which is a configurable hybrid hardware-software programmable $$\hbox {System-on-Chip (SoC)}$$ System-on-Chip (SoC) based on anyOCR for digitizing historical documents. In this paper, we focus on one of the most crucial processing steps in the anyOCR system: Text and Image Segmentation, which makes use of a multi-resolution morphology-based algorithm. Moreover, an optimized $$\hbox {FPGA}$$ FPGA -based hybrid architecture of this anyOCR step along with its optimized software implementations are presented. We demonstrate our results on multiple embedded and general-purpose platforms with respect to runtime and power consumption. The resulting hardware accelerator outperforms the existing anyOCR by 6.2$$\times$$ × , while achieving 207$$\times$$ × higher energy-efficiency and maintaining its high accuracy.

Download Full-text

Form design for high accuracy optical character recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/34.506417 ◽

1996 ◽

Vol 18 (6) ◽

pp. 653-656 ◽

Cited By ~ 5

Author(s):

M.D. Garris ◽

D.L. Dimmick

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

High Accuracy ◽

Optical Character ◽

Form Design

Download Full-text

Evaluation of optical character recognition algorithms and feature extraction techniques

2016 Sixth International Conference on Innovative Computing Technology (INTECH) ◽

10.1109/intech.2016.7845112 ◽

2016 ◽

Cited By ~ 3

Author(s):

Syed Hassan Tanvir ◽

Tamim Ahmed Khan ◽

Abu Bakar Yamin

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Optical Character Recognition ◽

Extraction Techniques ◽

Optical Character ◽

Recognition Algorithms

Download Full-text

High accuracy optical character recognition using neural networks with centroid dithering

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/34.368165 ◽

1995 ◽

Vol 17 (2) ◽

pp. 218-224 ◽

Cited By ~ 44

Author(s):

H.I. Avi-Itzhak ◽

T.A. Diep ◽

H. Garland

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

High Accuracy ◽

Optical Character

Download Full-text

Bubble Captcha - A Start of the New Direction of Text Captcha Scheme Development

MENDEL ◽

10.13164/mendel.2017.1.057 ◽

2017 ◽

Vol 23 (1) ◽

pp. 57-64 ◽

Cited By ~ 1

Author(s):

Ondrej Bostik ◽

Karel Horak ◽

Jan Klecka

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Turing Test ◽

Simple Approach ◽

Internet Services ◽

Malicious Activity ◽

Optical Character ◽

Recognition Algorithms ◽

The World ◽

Human Capability

CAPTCHA, A Completely Automated Public Turing test to tell Computers and Humans Apart, iswell-known system widely used in all sorts of internet services around the world designated to secure the webfrom an automatic malicious activity. For almost two decades almost every system utilize a simple approach tothis problem containing a transcription of distorted letters from image to a text eld. The ground idea is to useimperfection of Optical Character Recognition algorithms against the computers. The development of OpticalCharacter recognition algorithms leads only to state, where the CAPTCHA schemes become more complex andhuman users have a great di culty with the transcription.This paper aims to present a new way of development of CAPTCHA schemes based more a human perception.The goal of this work is to implement new Captcha scheme and assess human capability to read unusual fontsnewer seen before.

Download Full-text

Multi-modal video retrieval using Dilated Pyramidal Residual network

Science and Technology Development Journal - Natural Sciences ◽

10.32508/stdjns.v2i5.789 ◽

2019 ◽

Vol 2 (5) ◽

pp. 138-143

Author(s):

An Ngoc Thuy La ◽

Dat Phuoc Nguyen ◽

Nhut Minh Pham ◽

Quan Hai Vu

Keyword(s):

Automatic Speech Recognition ◽

Character Recognition ◽

Optical Character Recognition ◽

Video Retrieval ◽

High Accuracy ◽

Research Topic ◽

Residual Network ◽

Sequence Recognition ◽

Optical Character ◽

Classification Tasks

Pyramidal Residual Network achieved high accuracy in image classification tasks. However, there is no previous work on sequence recognition tasks using this model. We presented how to extend its architecture to form Dilated Pyramidal Residual Network (DPRN), for this long-standing research topic and evaluate it on the problems of automatic speech recognition and optical character recognition. Together, they formed a multi-modal video retrieval framework for Vietnamese Broadcast News. Experiments were conducted on caption images and speech frames extracted from VTV broadcast videos. Results showed that DPRN was not only end-to-end trainable but also performed well in sequence recognition tasks.

Download Full-text

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

Eurasian Journal of Mathematical and Computer Applications ◽

10.32523/2306-6172-2020-8-2-41-51 ◽

2020 ◽

Vol 8 (2) ◽

pp. 41-51

Author(s):

I.Q. Habeeb ◽

Z.Q. Al-Zaydi ◽

H.N. Abdulkhudhur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Selection Technique ◽

Multiple Outputs ◽

Optical Character

Download Full-text

A Structured Method for the Recognition of Complex Historical Tables

History and Computing ◽

10.3366/hac.1997.9.1-3.58 ◽

1997 ◽

Vol 9 (1-3) ◽

pp. 58-77

Author(s):

Vitaly Kliatskine ◽

Eugene Shchepin ◽

Gunnar Thorvaldsen ◽

Konstantin Zingerman ◽

Valery Lazarev

Keyword(s):

Nineteenth Century ◽

Character Recognition ◽

Optical Character Recognition ◽

Complex Structure ◽

Source Material ◽

Historical Sources ◽

Tax Assessment ◽

Optical Character ◽

Algorithmic Model ◽

Machine Readable

In principle, printed source material should be made machine-readable with systems for Optical Character Recognition, rather than being typed once more. Offthe-shelf commercial OCR programs tend, however, to be inadequate for lists with a complex layout. The tax assessment lists that assess most nineteenth century farms in Norway, constitute one example among a series of valuable sources which can only be interpreted successfully with specially designed OCR software. This paper considers the problems involved in the recognition of material with a complex table structure, outlining a new algorithmic model based on ‘linked hierarchies’. Within the scope of this model, a variety of tables and layouts can be described and recognized. The ‘linked hierarchies’ model has been implemented in the ‘CRIPT’ OCR software system, which successfully reads tables with a complex structure from several different historical sources.

Download Full-text

CNN-based Rain Reduction in Street View Images

London Imaging Meeting ◽

10.2352/issn.2694-118x.2020.lim-12 ◽

2020 ◽

Vol 2020 (1) ◽

pp. 78-81

Author(s):

Simone Zini ◽

Simone Bianco ◽

Raimondo Schettini

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Weather Conditions ◽

Specific Interest ◽

Optical Character ◽

Street View ◽

In The Wild ◽

Bad Weather ◽

Detection And Recognition

Rain removal from pictures taken under bad weather conditions is a challenging task that aims to improve the overall quality and visibility of a scene. The enhanced images usually constitute the input for subsequent Computer Vision tasks such as detection and classification. In this paper, we present a Convolutional Neural Network, based on the Pix2Pix model, for rain streaks removal from images, with specific interest in evaluating the results of the processing operation with respect to the Optical Character Recognition (OCR) task. In particular, we present a way to generate a rainy version of the Street View Text Dataset (R-SVTD) for "text detection and recognition" evaluation in bad weather conditions. Experimental results on this dataset show that our model is able to outperform the state of the art in terms of two commonly used image quality metrics, and that it is capable to improve the performances of an OCR model to detect and recognise text in the wild.

Download Full-text

High accuracy optical character recognition algorithms using learning array of ANN

A new approach for the failure mode analysis of optical character recognition algorithms

iDocChip: A Configurable Hardware Architecture for Historical Document Image Processing

Form design for high accuracy optical character recognition

Evaluation of optical character recognition algorithms and feature extraction techniques

High accuracy optical character recognition using neural networks with centroid dithering

Bubble Captcha - A Start of the New Direction of Text Captcha Scheme Development

Multi-modal video retrieval using Dilated Pyramidal Residual network

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

A Structured Method for the Recognition of Complex Historical Tables

CNN-based Rain Reduction in Street View Images

Export Citation Format