CNN-based Rain Reduction in Street View Images

Rain removal from pictures taken under bad weather conditions is a challenging task that aims to improve the overall quality and visibility of a scene. The enhanced images usually constitute the input for subsequent Computer Vision tasks such as detection and classification. In this paper, we present a Convolutional Neural Network, based on the Pix2Pix model, for rain streaks removal from images, with specific interest in evaluating the results of the processing operation with respect to the Optical Character Recognition (OCR) task. In particular, we present a way to generate a rainy version of the Street View Text Dataset (R-SVTD) for "text detection and recognition" evaluation in bad weather conditions. Experimental results on this dataset show that our model is able to outperform the state of the art in terms of two commonly used image quality metrics, and that it is capable to improve the performances of an OCR model to detect and recognise text in the wild.

Download Full-text

From object detection to text detection and recognition: A brief evolution history of optical character recognition

Wiley Interdisciplinary Reviews Computational Statistics ◽

10.1002/wics.1547 ◽

2021 ◽

Author(s):

Haifeng Wang ◽

Changzai Pan ◽

Xiao Guo ◽

Chunlin Ji ◽

Ke Deng

Keyword(s):

Object Detection ◽

Character Recognition ◽

Optical Character Recognition ◽

Text Detection ◽

Optical Character ◽

History Of ◽

Detection And Recognition

Download Full-text

Possible Approaches for Character Recognition With Existing Methodologies and State-of-the-Art Techniques

Technological Innovations in Knowledge Management and Decision Support - Advances in Knowledge Acquisition, Transfer, and Management ◽

10.4018/978-1-5225-6164-4.ch010 ◽

2019 ◽

pp. 232-246

Author(s):

Rashmi Welekar ◽

Nileshsingh V. Thakur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Industrial Applications ◽

New Methods ◽

Optical Character ◽

The World ◽

New Hypothesis ◽

Key Questions ◽

Art Techniques

The world started to talk about optical character recognition (OCR) around 1870. Then over another 25 years OCR systems were designed for industrial applications. And now the OCR software is easily available online for free, through products like Acrobat reader, WebOCR, etc. But still the research is on. Do we need to switch direction or introduce new hypothesis are some of the key questions? The purpose of this chapter is to answer the above questions and propose new methods for character recognition.

Download Full-text

OPTICAL CHARACTER RECOGNITION — A SURVEY

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001491000041 ◽

1991 ◽

Vol 05 (01n02) ◽

pp. 1-24 ◽

Cited By ~ 107

Author(s):

S. IMPEDOVO ◽

L. OTTAVIANO ◽

S. OCCHINEGRO

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

The State ◽

Historical Background ◽

Optical Scanner ◽

Optical Character ◽

Recognition Systems

In order to highlight the interesting problems and actual results on the state of the art in optical character recognition (OCR), this paper describes and compares preprocessing, feature extraction and postprocessing techniques for commercial reading machines. Problems related to handwritten and printed character recognition are pointed out, and the functions and operations of the major components of an OCR system are described. Historical background on the development of character recognition is briefly given and the working of an optical scanner is explained. The specifications of several recognition systems that are commercially available are reported and compared.

Download Full-text

Amharic OCR: An End-to-End Learning

Applied Sciences ◽

10.3390/app10031117 ◽

2020 ◽

Vol 10 (3) ◽

pp. 1117 ◽

Cited By ~ 1

Author(s):

Birhanu Belay ◽

Tewodros Habtegebrial ◽

Million Meshesha ◽

Marcus Liwicki ◽

Gebeyehu Belay ◽

...

Keyword(s):

Recurrent Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Writing System ◽

Recent Success ◽

Optical Character ◽

Proposed Model ◽

Feature Extractor ◽

End To End

In this paper, we introduce an end-to-end Amharic text-line image recognition approach based on recurrent neural networks. Amharic is an indigenous Ethiopic script which follows a unique syllabic writing system adopted from an ancient Geez script. This script uses 34 consonant characters with the seven vowel variants of each (called basic characters) and other labialized characters derived by adding diacritical marks and/or removing parts of the basic characters. These associated diacritics on basic characters are relatively smaller in size, visually similar, and challenging to distinguish from the derived characters. Motivated by the recent success of end-to-end learning in pattern recognition, we propose a model which integrates a feature extractor, sequence learner, and transcriber in a unified module and then trained in an end-to-end fashion. The experimental results, on a printed and synthetic benchmark Amharic Optical Character Recognition (OCR) database called ADOCR, demonstrated that the proposed model outperforms state-of-the-art methods by 6.98% and 1.05%, respectively.

Download Full-text

Real-time Automated Detection and Recognition of Nigerian License Plates via Deep Learning Single Shot Detection and Optical Character Recognition

Computer and Information Science ◽

10.5539/cis.v14n4p11 ◽

2021 ◽

Vol 14 (4) ◽

pp. 11

Author(s):

Kayode David Adedayo ◽

Ayomide Oluwaseyi Agunloye

Keyword(s):

Real Time ◽

Character Recognition ◽

Optical Character Recognition ◽

Character Segmentation ◽

Detection Accuracy ◽

License Plate ◽

Single Shot ◽

Optical Character ◽

License Plate Detection ◽

Detection And Recognition

License plate detection and recognition are critical components of the development of a connected Intelligent transportation system, but are underused in developing countries because to the associated costs. Existing license plate detection and recognition systems with high accuracy require the usage of Graphical Processing Units (GPU), which may be difficult to come by in developing nations. Single stage detectors and commercial optical character recognition engines, on the other hand, are less computationally expensive and can achieve acceptable detection and recognition accuracy without the use of a GPU. In this work, a pretrained SSD model and a tesseract tessdata-fast traineddata were fine-tuned on a dataset of more than 2,000 images of vehicles with license plate. These models were combined with a unique image preprocessing algorithm for character segmentation and tested using a general-purpose personal computer on a new collection of 200 automobiles with license plate photos. On this testing set, the plate detection system achieved a detection accuracy of 99.5 % at an IOU threshold of 0.45 while the OCR engine successfully recognized all characters on 150 license plates, one character incorrectly on 24 license plates, and two or more incorrect characters on 26 license plates. The detection procedure took an average of 80 milliseconds, while the character segmentation and identification stages took an average of 95 milliseconds, resulting in an average processing time of 175 milliseconds per image, or 6 photos per second. The obtained results are suitable for real-time traffic applications.

Download Full-text

Evaluation of OCR free software applied to old books

Revista dos Trabalhos de Iniciação Científica da UNICAMP ◽

10.20396/revpibic2620181132 ◽

2019 ◽

Author(s):

Pedro H. Barcha Correia ◽

Gerberth Adín Ramírez Rivera

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Free Software ◽

Data Input ◽

Optical Character

This project compares state-of-the-art Free Software Optical Character Recognition (OCR) programs. Particularly, their results over old books pages were evaluated. Moreover, in order to optimize the recognition for this kind of data input, methods that are not implemented in the programs were proposed and their results were analyzed as well.

Download Full-text

A One-Pass Approach for Slope and Slant Estimation of Tri-Script Handwritten Words

Journal of Intelligent Systems ◽

10.1515/jisys-2018-0105 ◽

2018 ◽

Vol 29 (1) ◽

pp. 688-702 ◽

Cited By ~ 3

Author(s):

Suman Kumar Bera ◽

Radib Kar ◽

Souvik Saha ◽

Akash Chakrabarty ◽

Sagnik Lahiri ◽

...

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

State Of The Art ◽

Ground Truth ◽

Recognition System ◽

Optical Character ◽

Handwritten Word Recognition ◽

Computationally Expensive ◽

Word Images ◽

Ground Truth Information

Abstract Handwritten words can never complement printed words because the former are mostly written in either skewed or slanted form or in both. This very nature of handwriting adds a huge overhead when converting word images into machine-editable format through an optical character recognition system. Therefore, slope and slant corrections are considered as the fundamental pre-processing tasks in handwritten word recognition. For solving this, researchers have followed a two-pass approach where the slope of the word is corrected first and then slant correction is carried out subsequently, thus making the system computationally expensive. To address this issue, we propose a novel one-pass method, based on fitting an oblique ellipse over the word images, to estimate both the slope and slant angles of the same. Furthermore, we have developed three databases considering word images of three popular scripts used in India, namely Bangla, Devanagari, and Roman, along with ground truth information. The experimental results revealed the effectiveness of the proposed method over some state-of-the-art methods used for the aforementioned problem.

Download Full-text

SELECTION TECHNIQUE FOR MULTIPLE OUTPUTS OF OPTICAL CHARACTER RECOGNITION

Eurasian Journal of Mathematical and Computer Applications ◽

10.32523/2306-6172-2020-8-2-41-51 ◽

2020 ◽

Vol 8 (2) ◽

pp. 41-51

Author(s):

I.Q. Habeeb ◽

Z.Q. Al-Zaydi ◽

H.N. Abdulkhudhur

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Selection Technique ◽

Multiple Outputs ◽

Optical Character

Download Full-text

A Structured Method for the Recognition of Complex Historical Tables

History and Computing ◽

10.3366/hac.1997.9.1-3.58 ◽

1997 ◽

Vol 9 (1-3) ◽

pp. 58-77

Author(s):

Vitaly Kliatskine ◽

Eugene Shchepin ◽

Gunnar Thorvaldsen ◽

Konstantin Zingerman ◽

Valery Lazarev

Keyword(s):

Nineteenth Century ◽

Character Recognition ◽

Optical Character Recognition ◽

Complex Structure ◽

Source Material ◽

Historical Sources ◽

Tax Assessment ◽

Optical Character ◽

Algorithmic Model ◽

Machine Readable

In principle, printed source material should be made machine-readable with systems for Optical Character Recognition, rather than being typed once more. Offthe-shelf commercial OCR programs tend, however, to be inadequate for lists with a complex layout. The tax assessment lists that assess most nineteenth century farms in Norway, constitute one example among a series of valuable sources which can only be interpreted successfully with specially designed OCR software. This paper considers the problems involved in the recognition of material with a complex table structure, outlining a new algorithmic model based on ‘linked hierarchies’. Within the scope of this model, a variety of tables and layouts can be described and recognized. The ‘linked hierarchies’ model has been implemented in the ‘CRIPT’ OCR software system, which successfully reads tables with a complex structure from several different historical sources.

Download Full-text

ANALYZING DIFFERENT ALGORITHMS AND TECHNIQUES TO FIND OPTICAL CHARACTER RECOGNITION FOR TAMIL SCRIPTS

JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES ◽

10.26782/jmcms.2020.02.00029 ◽

2020 ◽

Vol 15 (2) ◽

Author(s):

Rajkumar N

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Optical Character

Download Full-text