Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text

Abdelrahman Abdallah; Mohamed Hamada; Daniyar Nurseitov

doi:10.3390/jimaging6120141

Attention-Based Fully Gated CNN-BGRU for Russian Handwritten Text

Journal of Imaging ◽

10.3390/jimaging6120141 ◽

2020 ◽

Vol 6 (12) ◽

pp. 141

Author(s):

Abdelrahman Abdallah ◽

Mohamed Hamada ◽

Daniyar Nurseitov

Keyword(s):

Error Rate ◽

Handwriting Recognition ◽

Text Recognition ◽

P Value ◽

Word Error Rate ◽

Test Dataset ◽

Handwritten Text ◽

Proposed Model ◽

Handwritten Text Recognition ◽

Gated Recurrent Unit

This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models.

Download Full-text

Line-segment Feature Analysis Algorithm Using Input Dimensionality Reduction for Handwritten Text Recognition

Applied Sciences ◽

10.3390/app10196904 ◽

2020 ◽

Vol 10 (19) ◽

pp. 6904

Author(s):

Chang-Min Kim ◽

Ellen J. Hong ◽

Kyungyong Chung ◽

Roy C. Park

Keyword(s):

Dimensionality Reduction ◽

Line Segment ◽

Handwriting Recognition ◽

Text Recognition ◽

Feature Analysis ◽

Support Vector ◽

License Plate ◽

Field Methods ◽

Handwritten Text ◽

Handwritten Text Recognition

Recently, demand for handwriting recognition, such as automation of mail sorting, license plate recognition, and electronic memo pads, has exponentially increased in various industrial fields. In addition, in the image recognition field, methods using artificial convolutional neural networks, which show outstanding performance, have been applied to handwriting recognition. However, owing to the diversity of recognition application fields, the number of dimensions in the learning and reasoning processes is increasing. To solve this problem, a principal component analysis (PCA) technique is used for dimensionality reduction. However, PCA is likely to increase the accuracy loss due to data compression. Therefore, in this paper, we propose a line-segment feature analysis (LFA) algorithm for input dimensionality reduction in handwritten text recognition. This proposed algorithm extracts the line segment information, constituting the image of input data, and assigns a unique value to each segment using 3 × 3 and 5 × 5 filters. Using the unique values to identify the number of line segments and adding them up, a 1-D vector with a size of 512 is created. This vector is used as input to machine-learning. For the performance evaluation of the method, the Extending Modified National Institute of Standards and Technology (EMNIST) database was used. In the evaluation, PCA showed 96.6% and 93.86% accuracy with k-nearest neighbors (KNN) and support vector machine (SVM), respectively, while LFA showed 97.5% and 98.9% accuracy with KNN and SVM, respectively.

Download Full-text

Boosting of Deep Convolutional Architectures for Arabic Handwriting Recognition

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2019100102 ◽

2019 ◽

Vol 10 (4) ◽

pp. 26-45 ◽

Cited By ~ 1

Author(s):

Mohamed Elleuch ◽

Monji Kherallah

Keyword(s):

Character Recognition ◽

State Of The Art ◽

Handwriting Recognition ◽

Image Data ◽

Text Recognition ◽

Deep Belief Networks ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Accuracy Rates ◽

Hierarchical Representations

In recent years, deep learning (DL) based systems have become very popular for constructing hierarchical representations from unlabeled data. Moreover, DL approaches have been shown to exceed foregoing state of the art machine learning models in various areas, by pattern recognition being one of the more important cases. This paper applies Convolutional Deep Belief Networks (CDBN) to textual image data containing Arabic handwritten script (AHS) and evaluated it on two different databases characterized by the low/high-dimension property. In addition to the benefits provided by deep networks, the system is protected against over-fitting. Experimentally, the authors demonstrated that the extracted features are effective for handwritten character recognition and show very good performance comparable to the state of the art on handwritten text recognition. Yet using Dropout, the proposed CDBN architectures achieved a promising accuracy rates of 91.55% and 98.86% when applied to IFN/ENIT and HACDB databases, respectively.

Download Full-text

Handwritten Text Recognition using Machine Learning Techniques in Application of NLP

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a4748.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 1394-1397

Keyword(s):

Pattern Recognition ◽

Handwriting Recognition ◽

Machine Learning Techniques ◽

Text Recognition ◽

Spatial Form ◽

Process Systems ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Bank Checks ◽

Cursive Script

Handwriting Detection is a technique or ability of a Computer to receive and interpret intelligible handwritten input from source such as paper documents, touch screen, photo graphs etc. Handwritten Text recognition is one of area pattern recognition. The purpose of pattern recognition is to categorizing or classification data or object of one of the classes or categories. Handwriting recognition is defined as the task of transforming a language represented in its spatial form of graphical marks into its symbolic representation. Each script has a set of icons, which are known as characters or letters, which have certain basic shapes. The goal of handwriting is to identify input characters or image correctly then analyzed to many automated process systems. This system will be applied to detect the writings of different format. The development of handwriting is more sophisticated, which is found various kinds of handwritten character such as digit, numeral, cursive script, symbols, and scripts including English and other languages. The automatic recognition of handwritten text can be extremely useful in many applications where it is necessary to process large volumes of handwritten data, such as recognition of addresses and postcodes on envelopes, interpretation of amounts on bank checks, document analysis, and verification of signatures. Therefore, computer is needed to be able to read document or data for ease of document processing.

Download Full-text

Hiragana Handwriting Recognition Using Deep Neural Network Search

International Journal of Interactive Mobile Technologies (iJIM) ◽

10.3991/ijim.v14i01.11593 ◽

2020 ◽

Vol 14 (01) ◽

pp. 161

Author(s):

Rosalina Rosalina ◽

Johanes Parlindungan Hutagalung ◽

Genta Sahuri

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Neural Network ◽

Handwriting Recognition ◽

Text Recognition ◽

Android Application ◽

Network Search ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Computer Storage

<span id="orcid-id" class="orcid-id-https">These days there is a huge demand in “storing the information available in paper documents into a computer storage disk”. Digitizing manual filled forms lead to handwriting recognition, a process of translating handwriting into machine editable text. The main objective of this research is to to create an Android application able to recognize and predict the output of handwritten characters by training a neural network model. This research will implement deep neural network in recognizing handwritten text recognition especially to recognize digits, Latin / Alphabet and Hiragana, capture an image or choose the image from gallery to scan the handwritten text from the image, use the live camera to detect the handwritten text real – time without capturing an image and could copy the results of the output from the off-line recognition and share it to other platforms such as notes, Email, and social media. </span>

Download Full-text

Feature Extraction Comparison in Handwriting Recognition of Batak Toba Alphabet

IJITEE (International Journal of Information Technology and Electrical Engineering) ◽

10.22146/ijitee.31969 ◽

2018 ◽

Vol 1 (3) ◽

pp. 86

Author(s):

Novie Theresia Br Pasaribu ◽

M. Jimmy Hasugian

Keyword(s):

Feature Extraction ◽

Handwriting Recognition ◽

Noise Removal ◽

Text Recognition ◽

Fourier Descriptor ◽

Research Topics ◽

Discriminative Feature ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Offline Handwriting Recognition

Offline handwriting recognition is one of the most prominent research topics due to its tremendous application and high variability as well. This paper covers the offline Batak Toba handwritten text recognition, from the noise removal, the process of feature extraction until the recognition by using several classifiers. Experiments show that elliptic fourier descriptor (EFD) is the most discriminative feature and Mahalanobis distance (MD) outperforms the two others classifier.

Download Full-text

Offline Writer Recognition for Kurdish Handwritten Text Document Based on Proposed Codebook

UHD Journal of Science and Technology ◽

10.21928/uhdjst.v5n2y2021.pp21-27 ◽

2021 ◽

Vol 5 (1) ◽

pp. 21

Author(s):

Twana Latif Mohammed ◽

Ahmed Abdullah Ahmed

Keyword(s):

Probability Distribution ◽

Handwriting Recognition ◽

Text Recognition ◽

Writer Identification ◽

Text Document ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Writer Recognition ◽

Prior Literature ◽

Writing Sample

Handwritten text recognition has been an ongoing attractive task to research in the field of document analysis and recognition with applications in handwriting forensics, paleography, document examination, and handwriting recognition. In the present research, an automatic method of writer recognition is presented using digitized images of unconstrained texts. Despite the increasing efforts by prior literature on the different methods used for the same purpose, such methods performance, particularly their accuracy, has not been promising, leaving plenty of room for improvements. This method made use of codebook-based writer characterization, with each writing sample represented by a group of computed features from a primary and secondary codebook. The writings were then represented through the computation of the probability of codebook patterns occurrence, and the probability distribution was employed for each writer’s characterization. Writer identification process involved comparing two writings through the computation of the distances between their respective probability distribution. The study carried out experiments to determine the performance of the implemented method in light of rates of identification with the help of standard datasets, namely, KRDOH and IAM, the former being the most current and largest Kurdish handwritten datasets with 1076 writers, and the latter being a dataset containing 650 writers. The outcome of the experiments was promising with a rate of identification of 94.3%, with the proposed method outperforming the state-of-the-art methods by 2–3%.

Download Full-text

Effective offline handwritten text recognition model based on a sequence-to-sequence approach with CNN–RNN networks

Neural Computing and Applications ◽

10.1007/s00521-020-05556-5 ◽

2021 ◽

Author(s):

R. Geetha ◽

T. Thilagam ◽

T. Padmavathy

Keyword(s):

Text Recognition ◽

Recognition Model ◽

Model Based ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

Offline Handwritten Text Recognition Using Deep Learning: A Review

Journal of Physics Conference Series ◽

10.1088/1742-6596/1848/1/012015 ◽

2021 ◽

Vol 1848 (1) ◽

pp. 012015

Author(s):

Yintong Wang ◽

Wenjie Xiao ◽

Shuo Li

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

ICFHR2014 Competition on Handwritten Text Recognition on Transcriptorium Datasets (HTRtS)

2014 14th International Conference on Frontiers in Handwriting Recognition ◽

10.1109/icfhr.2014.137 ◽

2014 ◽

Cited By ~ 18

Author(s):

Joan Andreu Sanchez ◽

Veronica Romero ◽

Alejandro H. Toselli ◽

Enrique Vidal

Keyword(s):

Text Recognition ◽

Handwritten Text ◽

Handwritten Text Recognition

Download Full-text

HANDWRITTEN WORD RECOGNITION USING CLASSIFIER ENSEMBLES GENERATED FROM MULTIPLE PROTOTYPES

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001404003496 ◽

2004 ◽

Vol 18 (05) ◽

pp. 957-974 ◽

Cited By ~ 4

Author(s):

SIMON GÜNTER ◽

HORST BUNKE

Keyword(s):

Handwriting Recognition ◽

Recognition Rate ◽

Classifier Ensembles ◽

Multiple Classifier Systems ◽

Classification Problems ◽

Classifier Systems ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Multiple Classifier ◽

System 1

Handwritten text recognition is one of the most difficult problems in the field of pattern recognition. In this paper, we describe our efforts towards improving the performance of state-of-the-art handwriting recognition systems through the use of classifier ensembles. There are many examples of classification problems in the literature where multiple classifier systems increase the performance over single classifiers. Normally one of the two following approaches is used to create a multiple classifier system. (1) Several classifiers are developed completely independent of each other and combined in a last step. (2) Several classifiers are created out of one prototype classifier by using so-called classifier ensemble creation methods. In this paper an algorithm which combines both approaches is introduced and it is used to increase the recognition rate of a hidden Markov model (HMM) based handwritten word recognizer.

Download Full-text