When Massive GPU Parallelism Ain’t Enough: A Novel Hardware Architecture of 2D-LSTM Neural Network

2022 ◽  
Vol 15 (1) ◽  
pp. 1-35
Author(s):  
Vladimir Rybalkin ◽  
Jonas Ney ◽  
Menbere Kina Tekleyohannes ◽  
Norbert Wehn

Multidimensional Long Short-Term Memory (MD-LSTM) neural network is an extension of one-dimensional LSTM for data with more than one dimension. MD-LSTM achieves state-of-the-art results in various applications, including handwritten text recognition, medical imaging, and many more. However, its implementation suffers from the inherently sequential execution that tremendously slows down both training and inference compared to other neural networks. The main goal of the current research is to provide acceleration for inference of MD-LSTM. We advocate that Field-Programmable Gate Array (FPGA) is an alternative platform for deep learning that can offer a solution when the massive parallelism of GPUs does not provide the necessary performance required by the application. In this article, we present the first hardware architecture for MD-LSTM. We conduct a systematic exploration to analyze a tradeoff between precision and accuracy. We use a challenging dataset for semantic segmentation, namely historical document image binarization from the DIBCO 2017 contest and a well-known MNIST dataset for handwritten digit recognition. Based on our new architecture, we implement FPGA-based accelerators that outperform Nvidia Geforce RTX 2080 Ti with respect to throughput by up to 9.9 and Nvidia Jetson AGX Xavier with respect to energy efficiency by up to 48 . Our accelerators achieve higher throughput, energy efficiency, and resource efficiency than FPGA-based implementations of convolutional neural networks (CNNs) for semantic segmentation tasks. For the handwritten digit recognition task, our FPGA implementations provide higher accuracy and can be considered as a solution when accuracy is a priority. Furthermore, they outperform earlier FPGA implementations of one-dimensional LSTMs with respect to throughput, energy efficiency, and resource efficiency.

2020 ◽  
Vol 224 ◽  
pp. 01025
Author(s):  
Alexey Beskopylny ◽  
Alexandr Lyapin ◽  
Nikita Beskopylny ◽  
Elena Kadomtseva

The article is devoted to the problem of comparing the effectiveness of feedforward (FF) and convolutional neural networks (CNN) algorithms in the problems of handwritten digit recognition and classification. In recent years, the attention of many researchers to the FF and CNN algorithms has given rise to many hybrid models focused on solving specific problems. At the same time, the efficiency of each algorithm in terms of accuracy and labour intensity remains unclear. It is shown that in classical problems, FFs can have advantages over CNN in terms of labour intensity with the same accuracy of results. Using the handwritten digits data from the MNIST database as an example, it is shown that FF algorithms provide greater accuracy and require less computation time than CNN.


2017 ◽  
Vol 25 (6) ◽  
pp. 979-990
Author(s):  
David Álvarez-León ◽  
Ramón-Ángel Fernández-Díaz ◽  
Lidia Sánchez-Gonzalez ◽  
José-Manuel Alija-Pérez

Abstract This article presents an Off-line handwritten digit recognition approach based on neural networks. We define a numeric character as a composition of vertical and horizontal strokes. After the preprocessing, we use dynamic zoning to retrieve the positions where vertical strokes – the main strokes — are joined to horizontal strokes. These features are recorded into a representative string and verified using a custom matching pattern. Finally, a multilayer perceptron neural network is fed with the previous data to raise the learning process. The results gathered from the experiments performed on the well-known MNIST handwritten database are compared against other proposals providing promising results.


Author(s):  
Anshul Dubey ◽  
Ashley Lazarus ◽  
Dharmendra Mangal

Handwritten digit recognition, is a technique of identifying and enlisting the recognized digit, that uses neural networks, deep learning and machine learning. The applications and demand of handwritten digit recognition systems such as zip code recognition, car number plate recognition, robotics, banks, mobile applications and numerous more, are soaring every day. It can be done through numerous approaches, but convolutional neural network is considered one of the best methods. The special neural network uses multilayer architecture for identification and classification. Although the accuracy factor can be increased, based on image preprocessing, in this paper we discuss how the accuracy of the system can be increased for better handwritten digit recognition, using convolutional neural networks, image preprocessing; binarization, resizing, rotation. The accuracy rate obtained is 99.33%.


2020 ◽  
Vol 17 (4) ◽  
pp. 572-578
Author(s):  
Mohammad Parseh ◽  
Mohammad Rahmanimanesh ◽  
Parviz Keshavarzi

Persian handwritten digit recognition is one of the important topics of image processing which significantly considered by researchers due to its many applications. The most important challenges in Persian handwritten digit recognition is the existence of various patterns in Persian digit writing that makes the feature extraction step to be more complicated.Since the handcraft feature extraction methods are complicated processes and their performance level are not stable, most of the recent studies have concentrated on proposing a suitable method for automatic feature extraction. In this paper, an automatic method based on machine learning is proposed for high-level feature extraction from Persian digit images by using Convolutional Neural Network (CNN). After that, a non-linear multi-class Support Vector Machine (SVM) classifier is used for data classification instead of fully connected layer in final layer of CNN. The proposed method has been applied to HODA dataset and obtained 99.56% of recognition rate. Experimental results are comparable with previous state-of-the-art methods


Sign in / Sign up

Export Citation Format

Share Document