A Bilingual Numeral OCR System for Creating Uni-Lingual Digitized Numeral Document

<p>The optical character recognition has been used in many applications such as dictionary generation, customer billing system, banking and postal automation, and library automation etc. The bilingual OCR system to make uni-lingual script helps us to reduce the requirement of two different OCR systems into a single OCR system for recognition of two different languages. This type of globalization helps the universal users of any language can read the text documents in their self-language if the bilingual documents are converted into uni-lingual document. In this paper, the image which contains printed Tamil and European numerals has been recognized using common OCR System and the Tamil numerals are converted into European numerals to globalize the document from a bilingual script into a uni-lingual document. The main objective of the work is to bring out the single numeral (European numerals) text document from the input image with two different numerals (Tamil and European Numerals). The Kohonen’s self-organizing map (SOM) based recognition system has been used for recognizing the numerals and recognized characters in bilingual numerals (Tamil and European Numerals) form are converted into Uni-lingual form (European numerals). This paper also discusses the various approaches used for OCR.</p>

Download Full-text

Development of the documents comparison module for an electronic document management system

Information Technology and Nanotechnology ◽

10.18287/1613-0073-2019-2416-527-533 ◽

2019 ◽

pp. 527-533

Author(s):

M A Mikheev ◽

P Y Yakimov

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Document Management ◽

Electronic Document ◽

Text Documents ◽

Text Document ◽

Document Management System ◽

Optical Character ◽

Electronic Document Management ◽

Scanned Image

The article is devoted to solving the problem of document versions comparison in electronic document management systems. Systems-analogues were considered, the process of comparing text documents was studied. In order to recognize the text on the scanned image, the technology of optical character recognition and its implementation — Tesseract library were chosen. The Myers algorithm is applied to compare received texts. The software implementation of the text document comparison module was implemented using the solutions described above.

Download Full-text

RECOGNITION OF HANDWRITTEN SIMILAR CHINESE CHARACTERS BY SELF-GROWING PROBABILISTIC DECISION-BASED NEURAL NETWORK

International Journal of Neural Systems ◽

10.1142/s0129065799000575 ◽

1999 ◽

Vol 09 (06) ◽

pp. 545-561 ◽

Cited By ~ 3

Author(s):

HSIN-CHIA FU ◽

Y. Y. XU ◽

H. Y. CHANG

Keyword(s):

Neural Network ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Accuracy ◽

Recognition System ◽

Difficult Problem ◽

Input Image ◽

Prototype System ◽

Credit Assignment ◽

Similar Character

Recognition of similar (confusion) characters is a difficult problem in optical character recognition (OCR). In this paper, we introduce a neural network solution that is capable of modeling minor differences among similar characters, and is robust to various personal handwriting styles. The Self-growing Probabilistic Decision-based Neural Network (SPDNN) is a probabilistic type neural network, which adopts a hierarchical network structure with nonlinear basis functions and a competitive credit-assignment scheme. Based on the SPDNN model, we have constructed a three-stage recognition system. First, a coarse classifier determines a character to be input to one of the pre-defined subclasses partitioned from a large character set, such as Chinese mixed with alphanumerics. Then a character recognizer determines the input image which best matches the reference character in the subclass. Lastly, the third module is a similar character recognizer, which can further enhance the recognition accuracy among similar or confusing characters. The prototype system has demonstrated a successful application of SPDNN to similar handwritten Chinese recognition for the public database CCL/HCCR1 (5401 characters × 200 samples). Regarding performance, experiments on the CCL/HCCR1 database produced 90.12% recognition accuracy with no rejection, and 94.11% accuracy with 6.7% rejection, respectively. This recognition accuracy represents about 4% improvement on the previously announced performance.5,11 As to processing speed, processing before recognition (including image preprocessing, segmentation, and feature extraction) requires about one second for an A4 size character image, and recognition consumes approximately 0.27 second per character on a Pentium-100 based personal computer, without use of any hardware accelerator or co-processor.

Download Full-text

10 mW CMOS retina and classifier for handheld, 1000 images/s optical character recognition system

1999 IEEE International Solid-State Circuits Conference. Digest of Technical Papers. ISSCC. First Edition (Cat. No.99CH36278) ◽

10.1109/isscc.1999.759194 ◽

2003 ◽

Author(s):

P. Masa ◽

P. Heim ◽

E. Franzi ◽

X. Arreguit ◽

F. Heitger ◽

...

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Real‐time optical character recognition on field programmable gate array for automatic number plate recognition system

IET Circuits Devices & Systems ◽

10.1049/iet-cds.2012.0339 ◽

2013 ◽

Vol 7 (6) ◽

pp. 337-344 ◽

Cited By ~ 20

Author(s):

Xiaojun Zhai ◽

Faycal Bensaali ◽

Reza Sotudeh

Keyword(s):

Real Time ◽

Field Programmable Gate Array ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character ◽

Field Programmable ◽

Gate Array

Download Full-text

Optical character recognition system based on a novel fuzzy descriptive features

Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004. ◽

10.1109/icosp.2004.1441471 ◽

2005 ◽

Author(s):

Y. Alginahi ◽

I. El-Feghi ◽

M. Ahmadi ◽

M.A. Sid-Ahmed

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Developing Automated Optical Character Recognition System Using Machine Learning Algorithm to Solve Payment Verification Issues

10.1109/icoris52787.2021.9649514 ◽

2021 ◽

Author(s):

Michael Siek ◽

Rafi Soeharto

Keyword(s):

Machine Learning ◽

Character Recognition ◽

Optical Character Recognition ◽

Learning Algorithm ◽

Recognition System ◽

Machine Learning Algorithm ◽

Optical Character

Download Full-text

Optical Character Recognition System for Urdu Words in Nastaliq Font

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2016.070575 ◽

2016 ◽

Vol 7 (5) ◽

Cited By ~ 5

Author(s):

Safia Shabbir ◽

Imran Siddiqi

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Handwriting Recognition System Using Optical Character Recognition

International Journal of Scientific Research in Computer Sciences and Engineering ◽

10.26438/ijsrcse/v6i3.1821 ◽

2018 ◽

Vol 6 (3) ◽

pp. 18-21

Author(s):

Priti Gangania ◽

Sowmya Mishra ◽

Shreshtha Garg ◽

Sonam Agarwal

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Handwriting Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Unconstrained Handwritten Text Line Segmentation for Kannada Language

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j9624.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 953-956

Keyword(s):

Character Recognition ◽

Recognition System ◽

Text Line ◽

Connected Component ◽

Horizontal Projection ◽

Text Documents ◽

Handwritten Text ◽

Kannada Language ◽

System Separation ◽

Line Segmentation

Segmentation is division of something into smaller parts and one of the Component of character recognition system. Separation of characters, words and lines are done in Segmentation from text documents. character recognition is a process which allows computers to recognize written or printed characters such as numbers or letters and to change them into a form that the computer can use. the accuracy of OCR system is done by taking the output of an OCR run for an image and comparing it to the original version of the same text. The main aim of this paper is to find out the various text line segmentations are Projection profiles, Weighted Bucket Method. Proposed method is horizontal projection profile and connected component method on Handwritten Kannada language. These methods are used for experimentation and finally comparing their accuracy and results.

Download Full-text

A large-scale optical character recognition system simulation

10.1145/800287.811185 ◽

1974 ◽

Cited By ~ 1

Author(s):

David P. Himmel ◽

David Peasner

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Large Scale ◽

System Simulation ◽

Recognition System ◽

Optical Character

Download Full-text