Hybrid pen-input character recognition system based on integration of online-offline recognition

Author(s):  
H. Tanaka ◽  
K. Nakajima ◽  
K. Ishigaki ◽  
K. Akiyama ◽  
M. Nakagawa
Author(s):  
Yi-Hong Tseng ◽  
Chi-Chang Kuo ◽  
Hsi-Jian Lee

In this paper, we propose a methodology for identifying typefaces of printed Chinese characters in documents. Three kinds of features, stroke width means, stroke width variations, and aspect ratio, are first used to classify character typefaces as: Black, Li, Kai-Round, or Ming-Song. Each of the last two groups contains two typefaces. Vertical/horizontal stroke width ratios are used to distinguish between the Ming and Song typefaces and accumulative pixel ratio to distinguish between the Kai and Round typefaces. Six different typeface feature distributions measured from 5401 printed Chinese characters are considered, and a trapezoid-shaped membership function is constructed for each distribution. Based on these membership functions, we determine what typeface each input character belongs to using a two-level decision tree. To increase the identification rate, the typeface of a certain character is adjusted according to the typeface identification results of the front and the next characters. In the character recognition system, we use two statistical features: crossing counts and contour directional counts. We achieved an 89.87% typeface identification rate in our experiments, and a 95.60% character recognition rate.


Author(s):  
Manish M. Kayasth ◽  
Bharat C. Patel

The entire character recognition system is logically characterized into different sections like Scanning, Pre-processing, Classification, Processing, and Post-processing. In the targeted system, the scanned image is first passed through pre-processing modules then feature extraction, classification in order to achieve a high recognition rate. This paper describes mainly on Feature extraction and Classification technique. These are the methodologies which play an important role to identify offline handwritten characters specifically in Gujarati language. Feature extraction provides methods with the help of which characters can identify uniquely and with high degree of accuracy. Feature extraction helps to find the shape contained in the pattern. Several techniques are available for feature extraction and classification, however the selection of an appropriate technique based on its input decides the degree of accuracy of recognition. 


2018 ◽  
Author(s):  
I Wayan Agus Surya Darma

Balinese character recognition is a technique to recognize feature or pattern of Balinese character. Feature of Balinese character is generated through feature extraction process. This research using handwritten Balinese character. Feature extraction is a process to obtain the feature of character. In this research, feature extraction process generated semantic and direction feature of handwritten Balinese character. Recognition is using K-Nearest Neighbor algorithm to recognize 81 handwritten Balinese character. The feature of Balinese character images tester are compared with reference features. Result of the recognition system with K=3 and reference=10 is achieved a success rate of 97,53%.


2020 ◽  
Vol 17 (3) ◽  
pp. 299-305 ◽  
Author(s):  
Riaz Ahmad ◽  
Saeeda Naz ◽  
Muhammad Afzal ◽  
Sheikh Rashid ◽  
Marcus Liwicki ◽  
...  

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.


Sign in / Sign up

Export Citation Format

Share Document