Extended zone based handwritten Malayalam character recognition using structural features

Author(s):  
P. V. Raveena ◽  
Ajay James ◽  
C. Saravanan
Author(s):  
Saurabh Ravindra Nikam

Abstract: In this paper Segmentation is one the most important process which decides the success of character recognition fashion. Segmentation is used to putrefy an image of a sequence of characters into sub images of individual symbols by segmenting lines and words. In segmentation image is partitioned into multiple corridor. With respect to the segmentation of handwritten words into characters it's a critical task because of complexity of structural features and kinds in writing styles. Due to this without segmentation these touching characters, it's delicate to fete the individual characters, hence arises the need for segmentation of touching characters in a word. Then we consider Marathi words and Marathi Numbers for segmentation. The algorithm is use for Segmentation of lines and also characters. The segmented characters are also stores in result variable. First it Separate the lines and also it Separate the characters from the input image. This procedure is repeated till end of train. Keywords: Image Segmentation, Handwritten Marathi Characters, Marathi Numbers, OCR.


2020 ◽  
pp. 1-12
Author(s):  
Ao Qi ◽  
Liu Narengerile

At present, the recognition method based on character segmentation is not effective in recognizing English text, and the traditional methods are based on the structural features and statistical characteristics of strokes. In order to improve the recognition effect of in English text, from the perspective of machine learning, this study introduces multi-features to improve the lack of information caused by the small Chinese data set. Moreover, this study disassembles the character recognition problem into a text matching problem of question and answer, and the textual entailment problem of answer and standard answer and continues training on the data set of short text score. The final result has a certain improvement, which proves the usability of the mechanism designed in this paper. In order to study the performance of the model proposed in this paper, the model proposed in this paper and the neural network recognition model are compared in terms of recognition accuracy and recognition speed. The research results show that the algorithm proposed in this paper has a certain effect.


Optical character recognition (OCR) is a strategy to perceive character from optically checked and digitized pages. OCR plays an important role for Indian script research. The official language of the state Odisha is Odia. OCR face an incredible difficulties to recognize Odia language due to similar shape characters, their complex nature, the complicated way in which they combine form to compound character, use of Matra etc. Each character and numbers are passed through several modules like binarization, noise removal, segmentation, line segmentation, word segmentation, skeletonization, deskewing, thinning, thickening. The input picture is standardized to a size of 50 x 50 2D pictures. HMM is a stochastic process which has utilized in various applications for example speech recognition, Handwriting recognition, Gesture recognition. In this paper we utilized HMM to recognize the Odia character and numbers. Hidden Markov Model have many advantages such as resistant to noise, handle contrast recorded as a hard copy and the HMM devices are effectively accessible. In our proposed method we have developed an efficient recognition algorithm using Hidden Markov model based on moment based and structural feature to recognize Odia characters and numerals.


Sign in / Sign up

Export Citation Format

Share Document