A word recognition algorithm for machine-printed word images of multiple fonts and varying qualities

Author(s):  
S.X. Zhao ◽  
S.N. Srihari
Author(s):  
George Saon ◽  
Abdel Belaïd

In this paper we present a system for the recognition of handwritten words on literal check amounts which advantageously combine HMMs and Markov random fields (MRFs). It operates at pixel level, in a holistic manner, on height normalized word images which are viewed as random field realizations. The HMM analyzes the image along the horizontal writing direction, in a specific state observation probability given by the column product of causal MRF-like pixel conditional probabilities. Aspects concerning definition, training and recognition via this type of model are developed throughout the paper. We report a 90.08% average word recognition rate on 2378 words and a 79.52% amount rate on 579 amounts of the SRTP* French postal check database (7031 words, 1779 amounts, different scriptors).


2013 ◽  
Vol 5 (2) ◽  
pp. 101-104
Author(s):  
Tomyslav Sledevič ◽  
Liudas Stašionis

The paper describes the FPGA-based implementation of Lithuanian isolated word recognition algorithm. FPGA is selected for parallel process implementation using VHDL to ensure fast signal processing at low rate clock signal. Cepstrum analysis was applied to features extraction in voice. The dynamic time warping algorithm was used to compare the vectors of cepstrum coefficients. A library of 100 words features was created and stored in the internal FPGA BRAM memory. Experimental testing with speaker dependent records demonstrated the recognition rate of 94%. The recognition rate of 58% was achieved for speaker-independent records. Calculation of cepstrum coefficients lasted for 8.52 ms at 50 MHz clock, while 100 DTWs took 66.56 ms at 25 MHz clock. Article in Lithuanian. Santrauka Pateikiamas lietuvių kalbos pavienių žodžių atpažinimo algoritmo įgyvendinimas lauku programuojama logine matrica (LPLM). LPLM įrenginys pasirinktas dėl lygiagrečiai veikiančių procesų įgyvendinimo galimybės taikant VHDL kalbą. Tai užtikrina spartų signalų apdorojimą esant taktiniam dažniui iki 50 MHz. Kalbos požymiams išskirti taikoma kepstrinė šnekos analizė. Požymiams palyginti taikomas dinaminis laiko skalės kraipymo (DSLK) metodas. Sudaryta 100 žodžių požymių biblioteka, kuri saugoma vidinėje LPLM BRAM atmintyje. Pasiektas 94 % atpažinimo tikslumas priklausomai nuo kalbėtojo ir 58 % – nepriklausomai nuo kalbėtojo. Kepstro koeficientų skaičiavimas vienam žodžiui trunka 8,52 ms, esant 50 MHz taktiniam dažniui, ir šimtui DLSK – 66,56 ms, esant 25 MHz taktiniam dažniui.


1980 ◽  
Vol 3 (2) ◽  
pp. 60-70 ◽  
Author(s):  
Jean R. Harber

This article reviews available research findings on the influence of illustrations on the reading performance (i.e., word recognition and reading comprehension) of beginning readers in general and in specific subgroups of beginning readers (e.g., poor achievers, low-ability students). Findings suggest that the presence of illustrations interferes with poorly achieving and low-ability children's performance on word recognition tasks and that illustrations are of questionable value to such children's performance on reading comprehension tasks. The possibility that illustrations serve to distract the poor reader's attention from the printed word is discussed. The inability to filter out extraneous stimuli and focus selectively on a task frequently seen in learning disabled children is presented in terms of selective attention theory. Suggestions are offered for further research on the effect of illustrations on learning disabled youngsters' reading performance.


2021 ◽  
Vol 3 (8) ◽  
Author(s):  
Fetulhak Abdurahman ◽  
Eyob Sisay ◽  
Kinde Anlay Fante

AbstractAmharic ("Image missing") is the official language of the Federal Government of Ethiopia, with more than 27 million speakers. It uses an Ethiopic script, which has 238 core and 27 labialized characters. It is a low-resourced language, and a few attempts have been made so far for its handwritten text recognition. However, Amharic handwritten text recognition is challenging due to the very high similarity between characters. This paper presents a convolutional recurrent neural networks based offline handwritten Amharic word recognition system. The proposed framework comprises convolutional neural networks (CNNs) for feature extraction from input word images, recurrent neural network (RNNs) for sequence encoding, and connectionist temporal classification as a loss function. We designed a custom CNN model and compared its performance with three different state-of-the-art CNN models, including DenseNet-121, ResNet-50 and VGG-19 after modifying their architectures to fit our problem domain, for robust feature extraction from handwritten Amharic word images. We have conducted detailed experiments with different CNN and RNN architectures, input word image sizes, and applied data augmentation techniques to enhance performance of the proposed models. We have prepared a handwritten Amharic word dataset, HARD-I, which is available publicly for researchers. From the experiments on various recognition models using our dataset, a WER of 5.24 % and CER of 1.15 % were achieved using our best-performing recognition model. The proposed models achieve a competitive performance compared to existing models for offline handwritten Amharic word recognition.


Sign in / Sign up

Export Citation Format

Share Document