A word recognition algorithm for machine-printed word images of multiple fonts and varying qualities

In this paper we present a system for the recognition of handwritten words on literal check amounts which advantageously combine HMMs and Markov random fields (MRFs). It operates at pixel level, in a holistic manner, on height normalized word images which are viewed as random field realizations. The HMM analyzes the image along the horizontal writing direction, in a specific state observation probability given by the column product of causal MRF-like pixel conditional probabilities. Aspects concerning definition, training and recognition via this type of model are developed throughout the paper. We report a 90.08% average word recognition rate on 2378 words and a 79.52% amount rate on 579 amounts of the SRTP* French postal check database (7031 words, 1779 amounts, different scriptors).

Download Full-text

The role of tonal information during spoken-word recognition in Chinese: Evidence from a printed-word eye-tracking study

Memory & Cognition ◽

10.3758/s13421-020-01070-0 ◽

2020 ◽

Author(s):

Wei Shen ◽

Jukka Hyönä ◽

Youxi Wang ◽

Meiling Hou ◽

Jing Zhao

Keyword(s):

Word Recognition ◽

Eye Tracking ◽

Spoken Word Recognition ◽

Spoken Word ◽

Printed Word

Download Full-text

FPGA-BASED IMPLEMENTATION OF LITHUANIAN ISOLATED WORD RECOGNITION ALGORITHM / LIETUVIŲ KALBOS PAVIENIŲ ŽODŽIŲ ATPAŽINIMO ALGORITMO ĮGYVENDINIMAS LAUKU PROGRAMUOJAMA LOGINE MATRICA

Mokslas - Lietuvos ateitis ◽

10.3846/mla.2013.18 ◽

2013 ◽

Vol 5 (2) ◽

pp. 101-104

Author(s):

Tomyslav Sledevič ◽

Liudas Stašionis

Keyword(s):

Word Recognition ◽

Experimental Testing ◽

Recognition Rate ◽

Recognition Algorithm ◽

Clock Signal ◽

Time Warping ◽

Process Implementation ◽

Isolated Word ◽

Dynamic Time ◽

Isolated Word Recognition

The paper describes the FPGA-based implementation of Lithuanian isolated word recognition algorithm. FPGA is selected for parallel process implementation using VHDL to ensure fast signal processing at low rate clock signal. Cepstrum analysis was applied to features extraction in voice. The dynamic time warping algorithm was used to compare the vectors of cepstrum coefficients. A library of 100 words features was created and stored in the internal FPGA BRAM memory. Experimental testing with speaker dependent records demonstrated the recognition rate of 94%. The recognition rate of 58% was achieved for speaker-independent records. Calculation of cepstrum coefficients lasted for 8.52 ms at 50 MHz clock, while 100 DTWs took 66.56 ms at 25 MHz clock. Article in Lithuanian. Santrauka Pateikiamas lietuvių kalbos pavienių žodžių atpažinimo algoritmo įgyvendinimas lauku programuojama logine matrica (LPLM). LPLM įrenginys pasirinktas dėl lygiagrečiai veikiančių procesų įgyvendinimo galimybės taikant VHDL kalbą. Tai užtikrina spartų signalų apdorojimą esant taktiniam dažniui iki 50 MHz. Kalbos požymiams išskirti taikoma kepstrinė šnekos analizė. Požymiams palyginti taikomas dinaminis laiko skalės kraipymo (DSLK) metodas. Sudaryta 100 žodžių požymių biblioteka, kuri saugoma vidinėje LPLM BRAM atmintyje. Pasiektas 94 % atpažinimo tikslumas priklausomai nuo kalbėtojo ir 58 % – nepriklausomai nuo kalbėtojo. Kepstro koeficientų skaičiavimas vienam žodžiui trunka 8,52 ms, esant 50 MHz taktiniam dažniui, ir šimtui DLSK – 66,56 ms, esant 25 MHz taktiniam dažniui.

Download Full-text

A new connected word recognition algorithm based on HMM/LVQ segmentation and LVQ classification

[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing ◽

10.1109/icassp.1991.150291 ◽

1991 ◽

Cited By ~ 10

Author(s):

P. Ramesh ◽

S. Katagiri ◽

C.-H. Lee

Keyword(s):

Word Recognition ◽

Recognition Algorithm

Download Full-text

Effects of Illustrations on Reading Performance: Implications for Further LD Research

Learning Disability Quarterly ◽

10.2307/1510508 ◽

1980 ◽

Vol 3 (2) ◽

pp. 60-70 ◽

Cited By ~ 14

Author(s):

Jean R. Harber

Keyword(s):

Reading Comprehension ◽

Word Recognition ◽

Learning Disabled ◽

Reading Performance ◽

Beginning Readers ◽

Disabled Children ◽

The Poor ◽

Research Findings ◽

Printed Word ◽

Low Ability

This article reviews available research findings on the influence of illustrations on the reading performance (i.e., word recognition and reading comprehension) of beginning readers in general and in specific subgroups of beginning readers (e.g., poor achievers, low-ability students). Findings suggest that the presence of illustrations interferes with poorly achieving and low-ability children's performance on word recognition tasks and that illustrations are of questionable value to such children's performance on reading comprehension tasks. The possibility that illustrations serve to distract the poor reader's attention from the printed word is discussed. The inability to filter out extraneous stimuli and focus selectively on a task frequently seen in learning disabled children is presented in terms of selective attention theory. Suggestions are offered for further research on the effect of illustrations on learning disabled youngsters' reading performance.

Download Full-text

AHWR-Net: offline handwritten amharic word recognition using convolutional recurrent neural network

SN Applied Sciences ◽

10.1007/s42452-021-04742-x ◽

2021 ◽

Vol 3 (8) ◽

Author(s):

Fetulhak Abdurahman ◽

Eyob Sisay ◽

Kinde Anlay Fante

Keyword(s):

Neural Network ◽

Neural Networks ◽

Feature Extraction ◽

Word Recognition ◽

Recurrent Neural Network ◽

Text Recognition ◽

Input Word ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Word Images

AbstractAmharic ("Image missing") is the official language of the Federal Government of Ethiopia, with more than 27 million speakers. It uses an Ethiopic script, which has 238 core and 27 labialized characters. It is a low-resourced language, and a few attempts have been made so far for its handwritten text recognition. However, Amharic handwritten text recognition is challenging due to the very high similarity between characters. This paper presents a convolutional recurrent neural networks based offline handwritten Amharic word recognition system. The proposed framework comprises convolutional neural networks (CNNs) for feature extraction from input word images, recurrent neural network (RNNs) for sequence encoding, and connectionist temporal classification as a loss function. We designed a custom CNN model and compared its performance with three different state-of-the-art CNN models, including DenseNet-121, ResNet-50 and VGG-19 after modifying their architectures to fit our problem domain, for robust feature extraction from handwritten Amharic word images. We have conducted detailed experiments with different CNN and RNN architectures, input word image sizes, and applied data augmentation techniques to enhance performance of the proposed models. We have prepared a handwritten Amharic word dataset, HARD-I, which is available publicly for researchers. From the experiments on various recognition models using our dataset, a WER of 5.24 % and CER of 1.15 % were achieved using our best-performing recognition model. The proposed models achieve a competitive performance compared to existing models for offline handwritten Amharic word recognition.

Download Full-text