Hybrid pen-input character recognition system based on integration of online-offline recognition

In this paper, we propose a methodology for identifying typefaces of printed Chinese characters in documents. Three kinds of features, stroke width means, stroke width variations, and aspect ratio, are first used to classify character typefaces as: Black, Li, Kai-Round, or Ming-Song. Each of the last two groups contains two typefaces. Vertical/horizontal stroke width ratios are used to distinguish between the Ming and Song typefaces and accumulative pixel ratio to distinguish between the Kai and Round typefaces. Six different typeface feature distributions measured from 5401 printed Chinese characters are considered, and a trapezoid-shaped membership function is constructed for each distribution. Based on these membership functions, we determine what typeface each input character belongs to using a two-level decision tree. To increase the identification rate, the typeface of a certain character is adjusted according to the typeface identification results of the front and the next characters. In the character recognition system, we use two statistical features: crossing counts and contour directional counts. We achieved an 89.87% typeface identification rate in our experiments, and a 95.60% character recognition rate.

Download Full-text

A Study of Different Methodologies Helpful in the Identification of Offline Handwritten Script

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i6.287 ◽

2018 ◽

Vol 6 (6) ◽

pp. 307

Author(s):

Manish M. Kayasth ◽

Bharat C. Patel

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Post Processing ◽

Classification Technique ◽

Scanned Image ◽

Gujarati Language ◽

High Degree ◽

Selection Of

The entire character recognition system is logically characterized into different sections like Scanning, Pre-processing, Classification, Processing, and Post-processing. In the targeted system, the scanned image is first passed through pre-processing modules then feature extraction, classification in order to achieve a high recognition rate. This paper describes mainly on Feature extraction and Classification technique. These are the methodologies which play an important role to identify offline handwritten characters specifically in Gujarati language. Feature extraction provides methods with the help of which characters can identify uniquely and with high degree of accuracy. Feature extraction helps to find the shape contained in the pattern. Several techniques are available for feature extraction and classification, however the selection of an appropriate technique based on its input decides the degree of accuracy of recognition.

Download Full-text

Offline Ancient Tamil Character Recognition System Based On Structural Features

i-manager s Journal on Communication Engineering and Systems ◽

10.26634/jcs.1.3.1891 ◽

2012 ◽

Vol 1 (3) ◽

pp. 17-24

Author(s):

S. Rajakumar ◽

V. Subbiah Bharathi

Keyword(s):

Character Recognition ◽

Recognition System ◽

Structural Features

Download Full-text

Handwritten Balinesse Character Recognition using K-Nearest Neighbor

10.31227/osf.io/z6m8u ◽

2018 ◽

Author(s):

I Wayan Agus Surya Darma

Keyword(s):

Feature Extraction ◽

Success Rate ◽

Character Recognition ◽

Nearest Neighbor ◽

Recognition System ◽

Extraction Process ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm ◽

Character Feature

Balinese character recognition is a technique to recognize feature or pattern of Balinese character. Feature of Balinese character is generated through feature extraction process. This research using handwritten Balinese character. Feature extraction is a process to obtain the feature of character. In this research, feature extraction process generated semantic and direction feature of handwritten Balinese character. Recognition is using K-Nearest Neighbor algorithm to recognize 81 handwritten Balinese character. The feature of Balinese character images tester are compared with reference features. Result of the recognition system with K=3 and reference=10 is achieved a success rate of 97,53%.

Download Full-text

A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/3/3 ◽

2020 ◽

Vol 17 (3) ◽

pp. 299-305 ◽

Cited By ~ 1

Author(s):

Riaz Ahmad ◽

Saeeda Naz ◽

Muhammad Afzal ◽

Sheikh Rashid ◽

Marcus Liwicki ◽

...

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Data Augmentation ◽

Short Term Memory ◽

Recognition System ◽

Learning Approach ◽

Arabic Text ◽

Data Set ◽

Processing Step ◽

Handwritten Arabic

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Download Full-text