Online Handwritten Gujarati Word Recognition

Author(s):  
Vishal A. Naik ◽  
Apurva A. Desai

In this article, an online handwritten word recognition system for the Gujarati language is presented by combining strokes, characters, punctuation marks, and diacritics. The authors have used a support vector machine classification algorithm with a radial basis function kernel. The authors used a hybrid features set. The hybrid feature set consists of directional features with curvature data. The authors have used a normalized chain code and zoning-based chain code features. Words are a combination of characters and diacritics. Recognized strokes require post-processing to form a word. The authors have used location-based and mapping rule-based post-processing methods. The authors have achieved an accuracy of 95.3% for individual characters, 91.5% for individual words, and 83.3% for sentences. The average processing time for individual characters is 0.071 seconds.

Author(s):  
Manish M. Kayasth ◽  
Bharat C. Patel

The entire character recognition system is logically characterized into different sections like Scanning, Pre-processing, Classification, Processing, and Post-processing. In the targeted system, the scanned image is first passed through pre-processing modules then feature extraction, classification in order to achieve a high recognition rate. This paper describes mainly on Feature extraction and Classification technique. These are the methodologies which play an important role to identify offline handwritten characters specifically in Gujarati language. Feature extraction provides methods with the help of which characters can identify uniquely and with high degree of accuracy. Feature extraction helps to find the shape contained in the pattern. Several techniques are available for feature extraction and classification, however the selection of an appropriate technique based on its input decides the degree of accuracy of recognition. 


2020 ◽  
Vol 5 (2) ◽  
pp. 504
Author(s):  
Matthias Omotayo Oladele ◽  
Temilola Morufat Adepoju ◽  
Olaide ` Abiodun Olatoke ◽  
Oluwaseun Adewale Ojo

Yorùbá language is one of the three main languages that is been spoken in Nigeria. It is a tonal language that carries an accent on the vowel alphabets. There are twenty-five (25) alphabets in Yorùbá language with one of the alphabets a digraph (GB). Due to the difficulty in typing handwritten Yorùbá documents, there is a need to develop a handwritten recognition system that can convert the handwritten texts to digital format. This study discusses the offline Yorùbá handwritten word recognition system (OYHWR) that recognizes Yorùbá uppercase alphabets. Handwritten characters and words were obtained from different writers using the paint application and M708 graphics tablets. The characters were used for training and the words were used for testing. Pre-processing was done on the images and the geometric features of the images were extracted using zoning and gradient-based feature extraction. Geometric features are the different line types that form a particular character such as the vertical, horizontal, and diagonal lines. The geometric features used are the number of horizontal lines, number of vertical lines, number of right diagonal lines, number of left diagonal lines, total length of all horizontal lines, total length of all vertical lines, total length of all right slanting lines, total length of all left-slanting lines and the area of the skeleton. The characters are divided into 9 zones and gradient feature extraction was used to extract the horizontal and vertical components and geometric features in each zone. The words were fed into the support vector machine classifier and the performance was evaluated based on recognition accuracy. Support vector machine is a two-class classifier, hence a multiclass SVM classifier least square support vector machine (LSSVM) was used for word recognition. The one vs one strategy and RBF kernel were used and the recognition accuracy obtained from the tested words ranges between 66.7%, 83.3%, 85.7%, 87.5%, and 100%. The low recognition rate for some of the words could be as a result of the similarity in the extracted features.


2020 ◽  
Author(s):  
Thamba Meshach W ◽  
Hemajothi S ◽  
Mary Anita E A

Abstract Human affect recognition (HAR) using images of facial expression and electrocardiogram (ECG) signal plays an important role in predicting human intention. This system improves the performance of the system in applications like the security system, learning technologies and health care systems. The primary goal of our work is to recognize individual affect states automatically using the multilayered binary structured support vector machine (MBSVM), which efficiently classify the input into one of the four affect classes, relax, happy, sad and angry. The classification is performed efficiently by designing an efficient support vector machine (SVM) classifier in multilayer mode operation. The classifier is trained using the 8-fold cross-validation method, which improves the learning of the classifier, thus increasing its efficiency. The classification and recognition accuracy is enhanced and also overcomes the drawback of ‘facial mimicry’ by using hybrid features that are extracted from both facial images (visual elements) and physiological signal ECG (signal features). The reliability of the input database is improved by acquiring the face images and ECG signals experimentally and by inducing emotions through image stimuli. The performance of the affect recognition system is evaluated using the confusion matrix, obtaining the classification accuracy of 96.88%.


Author(s):  
XIAOLONG FAN ◽  
BRIJESH VERMA

This paper presents a comparative analysis of segmentation and non-segmentation based techniques for cursive handwritten word recognition. In our segmentation based technique, every word is segmented into characters, the chain code features are extracted from segmented characters, the features are fed to neural network classifier and finally the words are constructed using a string compare algorithm. In our non-segmentation based technique, the chain code features are extracted directly from words and the words are fed to a neural network classifier to classify them into word classes. To make a fair comparison, a CEDAR benchmark database is used, and the parameters such as the number of words, thresholding, resizing, feature extraction techniques, etc. are kept same for both the techniques. The experimental results and analysis show that the non-segmentation technique achieves higher recognition rate than the segmentation based technique.


Author(s):  
Ke Han ◽  
Ishwar K. Sethi

Off-line cursive script recognition has got increasing attention during the last three decades since it is of interest in several areas such as banking and postal service. An off-line cursive handwritten word recognition system is described in this paper and is used for legal amount interpretation in personal checks. The proposed recognition system uses a set of geometric and topologic features to characterize each word. By considering the spatial distribution of these features in a word image, the proposed system maps each word into two strings of finite symbols. A local associative indexing scheme is then used on these strings to organize a vocabulary. When presented with an unknown word, the system uses the same indexing scheme to retrieve a set of candidate words likely to match the input word. A verification process is then carried out to find the best match among the candidate set. The performance of the proposed system has been tested with a legal amount image database from real bankchecks. The results obtained indicate that the proposed system is able to recognize legal amounts with great accuracy.


Sign in / Sign up

Export Citation Format

Share Document