APPLICATION OF ZONAL AND CURVATURE FEATURES TO NUMERALS RECOGNITION

Binod Kumar Prasad

doi:10.18510/ijsrtm.2021.922

APPLICATION OF ZONAL AND CURVATURE FEATURES TO NUMERALS RECOGNITION

International Journal of Students Research in Technology & Management ◽

10.18510/ijsrtm.2021.922 ◽

2021 ◽

Vol 9 (2) ◽

pp. 7-12

Author(s):

Binod Kumar Prasad

Keyword(s):

Language Processing ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Signature Verification ◽

Optical Character ◽

Knn Classifier ◽

Average Recognition Rate ◽

Distance Coding

Purpose of the study: The purpose of this work is to present an offline Optical Character Recognition system to recognise handwritten English numerals to help automation of document reading. It helps to avoid tedious and time-consuming manual typing to key in important information in a computer system to preserve it for a longer time. Methodology: This work applies Curvature Features of English numeral images by encoding them in terms of distance and slope. The finer local details of images have been extracted by using Zonal features. The feature vectors obtained from the combination of these features have been fed to the KNN classifier. The whole work has been executed using the MatLab Image Processing toolbox. Main Findings: The system produces an average recognition rate of 96.67% with K=1 whereas, with K=3, the rate increased to 97% with corresponding errors of 3.33% and 3% respectively. Out of all the ten numerals, some numerals like ‘3’ and ‘8’ have shown respectively lower recognition rates. It is because of the similarity between their structures. Applications of this study: The proposed work is related to the recognition of English numerals. The model can be used widely for recognition of any pattern like signature verification, face recognition, character or word recognition in another language under Natural Language Processing, etc. Novelty/Originality of this study: The novelty of the work lies in the process of feature extraction. Curves present in the structure of a numeral sample have been encoded based on distance and slope thereby presenting Distance features and Slope features. Vertical Delta Distance Coding (VDDC) and Horizontal Delta Distance Coding (HDDC) encode a curve from vertical and horizontal directions to reveal concavity and convexity from different angles.

Download Full-text

Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages

The Computer Journal ◽

10.1093/comjnl/bxaa042 ◽

2020 ◽

Author(s):

Syed Saqib Raza Rizvi ◽

Muhammad Adnan Khan ◽

Sagheer Abbas ◽

Muhammad Asadullah ◽

Nida Anwer ◽

...

Keyword(s):

Extreme Learning Machine ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Software Systems ◽

Script Language ◽

Optical Character ◽

Handwritten Text ◽

Learning Machine

Abstract Optical character recognition systems convert printed or handwritten scripts into digital text formats like ASCII or UNICODE. Urdu-like script languages like Urdu, Punjabi and Sindhi are widely spoken languages of the world, especially in Asia. An enormous amount of printed and handwritten text of such languages exist, which needs to be converted into computer-understandable formats for knowledge extraction. In this study, extreme learning machine’s (ELM’s) most recently proposed variant called deep extreme learning machine (DELM)-based optical character recognition (OCR) system is proposed to enhance Urdu-like script language’s character recognition rate. The proposed DELM-based character recognition model is optimizing the OCR process by reducing the overhead of Pre-processing, Segmentation and Feature Extraction Layer. The proposed system evaluations accomplished 98.75% training accuracy with 1.492 × 10−3 RMSE and 98.12% testing accuracy with 1.587 × 10−3 RMSE, with six DELM hidden layers. The results show that the proposed system has attained the foremost recognition rate as compared to any previously proposed Urdu-like script language OCR system. This technique is applicable for machine-printed text and fractionally useful for handwritten text as well. This study will aid in the advancement of more accurate Urdu-like script OCR’s software systems in the future.

Download Full-text

A Design of a Hybrid Algorithm for Optical Character Recognition of Online Hand-Written Arabic Alphabets

Iraqi Journal of Science ◽

10.24996/ijs.2019.60.9.22 ◽

2019 ◽

pp. 2067-2079

Author(s):

Waleed Noori Hussein ◽

Haider N. Hussain

Keyword(s):

Decision Tree ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Optical Character ◽

Recognition Systems ◽

The Difference ◽

Artificial Neural Network Ann ◽

Handwritten Recognition

The growing relevance of printed and digitalized hand-written characters has necessitated the need for convalescent automatic recognition of characters in Optical Character Recognition (OCR). Among the handwritten characters, Arabic is one of those with special attention due to its distinctive nature, and the inherent challenges in its recognition systems. This distinctiveness of Arabic characters, with the difference in personal writing styles and proficiency, are complicating the effectiveness of its online handwritten recognition systems. This research, based on limitations and scope of previous related studies, studied the recognition of Arabic isolated characters through the identification of its features and dots in view of producing an efficient online Arabic handwriting isolated character recognition system. It proposes a hybrid of decision tree and Artificial Neural Network (ANN), as against being combined with other algorithms as found in previous studies. The proposed recognition process has four main steps with associated sub-steps. The results showed that the proposed method achieved the highest performance at 96.7%, whereas the benchmark methods which are EDMS and Naeimizaghiani had 68.88% and 78.5 % respectively. Based on this, ANN has the best performance recognition rate at 98.8%, while the best rate for decision tree was obtained at 97.2%.

Download Full-text

Recognition of handwritten Arabic (Indian) numerals using skeleton matching

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v19.i3.pp1461-1468 ◽

2020 ◽

Vol 19 (3) ◽

pp. 1461

Author(s):

Bassam Alqaralleh ◽

Malek Zakarya Alksasbeh ◽

Tamer Abukhalil ◽

Harbi Almahafzah ◽

Tawfiq Al Rawashdeh

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition Rate ◽

Numerical Data ◽

Recognition System ◽

Worst Case ◽

Limited Success ◽

Optical Character ◽

The Right ◽

Handwritten Arabic

This paper brings into discussion the problem of recognizing Arabic numbers using a monocular camera as the only sensor. When a digital image is presented in this application, optical character recognition (OCR) can be exploited to comprehend numerical data. However, there has been a limited success when applied to the handwritten Arabic (Indian) numbers. This paper aims to overcome this limitation and introduces optical character recognition system based on skeleton matching. The proposed approach is used for handwritten Arabic numbers only. The experimental results indicate the effectiveness of the proposed optical character recognition system even for numbers written in worst case. The right system achieves a recognition rate of 99.3 %.

Download Full-text

10 mW CMOS retina and classifier for handheld, 1000 images/s optical character recognition system

1999 IEEE International Solid-State Circuits Conference. Digest of Technical Papers. ISSCC. First Edition (Cat. No.99CH36278) ◽

10.1109/isscc.1999.759194 ◽

2003 ◽

Author(s):

P. Masa ◽

P. Heim ◽

E. Franzi ◽

X. Arreguit ◽

F. Heitger ◽

...

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Real‐time optical character recognition on field programmable gate array for automatic number plate recognition system

IET Circuits Devices & Systems ◽

10.1049/iet-cds.2012.0339 ◽

2013 ◽

Vol 7 (6) ◽

pp. 337-344 ◽

Cited By ~ 20

Author(s):

Xiaojun Zhai ◽

Faycal Bensaali ◽

Reza Sotudeh

Keyword(s):

Real Time ◽

Field Programmable Gate Array ◽

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character ◽

Field Programmable ◽

Gate Array

Download Full-text

Optical character recognition system based on a novel fuzzy descriptive features

Proceedings 7th International Conference on Signal Processing, 2004. Proceedings. ICSP '04. 2004. ◽

10.1109/icosp.2004.1441471 ◽

2005 ◽

Author(s):

Y. Alginahi ◽

I. El-Feghi ◽

M. Ahmadi ◽

M.A. Sid-Ahmed

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Developing Automated Optical Character Recognition System Using Machine Learning Algorithm to Solve Payment Verification Issues

10.1109/icoris52787.2021.9649514 ◽

2021 ◽

Author(s):

Michael Siek ◽

Rafi Soeharto

Keyword(s):

Machine Learning ◽

Character Recognition ◽

Optical Character Recognition ◽

Learning Algorithm ◽

Recognition System ◽

Machine Learning Algorithm ◽

Optical Character

Download Full-text

A Finding Aid for The Equity

Inquiry@Queen's Undergraduate Research Conference Proceedings ◽

10.24908/iqurcp.11597 ◽

2018 ◽

Author(s):

Jeff Blackadar

Keyword(s):

Language Processing ◽

Character Recognition ◽

Optical Character Recognition ◽

Large Collection ◽

Digital History ◽

R Language ◽

Potential Value ◽

Optical Character ◽

Text Searching ◽

Person Location

Bibliothèque et Archives Nationales du Québec digitally scanned and converted to text a large collection of newspapers to create a resource of tremendous potential value to historians. Unfortunately, the text files are difficult to search reliably due to many errors caused by the optical character recognition (OCR) text conversion process. This digital history project applied natural language processing in an R language computer program to create a new and useful index of this corpus of digitized content despite OCR related errors. The project used editions of The Equity, published in Shawville, Quebec since 1883. The program extracted the names of all the person, location and organization entities that appeared in each edition. Each of the entities was cataloged in a database and related to the edition of the newspaper it appeared in. The database was published to a public website to allow other researchers to use it. The resulting index or finding aid allows researchers to access The Equity in a different way than just full text searching. People, locations and organizations appearing in the Equity are listed on the website and each entity links to a page that lists all of the issues that entity appeared in as well as the other entities that may be related to it. Rendering the text files of each scanned newspaper into entities and indexing them in a database allows the content of the newspaper to be interacted with by entity name and type rather than just a set of large text files. Website: http://www.jeffblackadar.ca/graham_fellowship/corpus_entities_equity/

Download Full-text

A Prototype for Brazilian Bankcheck Recognition

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001497000238 ◽

1997 ◽

Vol 11 (04) ◽

pp. 549-569 ◽

Cited By ~ 16

Author(s):

Luan L. Lee ◽

Miguel G. Lizarraga ◽

Natanael R. Gomes ◽

Alessandro L. Koerich

Keyword(s):

Information Extraction ◽

Character Recognition ◽

Type I Error ◽

Recognition Rate ◽

Simulated Data ◽

Recognition System ◽

Signature Verification ◽

Type I ◽

Verification Algorithm ◽

Commercial Applications

This paper describes a prototype for Brazilian bankcheck recognition. The description is divided into three topics: bankcheck information extraction, digit amount recognition and signature verification. In bankcheck information extraction, our algorithms provide signature and digit amount images free of background patterns and bankcheck printed information. In digit amount recognition, we dealt with the digit amount segmentation and implementation of a complete numeral character recognition system involving image processing, feature extraction and neural classification. In signature verification, we designed and implemented a static signature verification system suitable for banking and commercial applications. Our signature verification algorithm is capable of detecting both simple, random and skilled forgeries. The proposed automatic bankcheck recognition prototype was intensively tested by real bankcheck data as well as simulated data providing the following performance results: for skilled forgeries, 4.7% equal error rate; for random forgeries, zero Type I error and 7.3% Type II error; for bankcheck numerals, 92.7% correct recognition rate.

Download Full-text

Optical Character Recognition System for Urdu Words in Nastaliq Font

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2016.070575 ◽

2016 ◽

Vol 7 (5) ◽

Cited By ~ 5

Author(s):

Safia Shabbir ◽

Imran Siddiqi

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Recognition System ◽

Optical Character

Download Full-text