Scene Text Recognition Based on Bidirectional LSTM and Deep Neural Network

Deep learning is a subfield of artificial intelligence that allows the computer to adopt and learn some new rules. Deep learning algorithms can identify images, objects, observations, texts, and other structures. In recent years, scene text recognition has inspired many researchers from the computer vision community, and still, it needs improvement because of the poor performance of existing scene recognition algorithms. This research paper proposed a novel approach for scene text recognition that integrates bidirectional LSTM and deep convolution neural networks. In the proposed method, first, the contour of the image is identified and then it is fed into the CNN. CNN is used to generate the ordered sequence of the features from the contoured image. The sequence of features is now coded using the Bi-LSTM. Bi-LSTM is a handy tool for extracting the features from the sequence of words. Hence, this paper combines the two powerful mechanisms for extracting the features from the image, and contour-based input image makes the recognition process faster, which makes this technique better compared to existing methods. The results of the proposed methodology are evaluated on MSRATD 50 dataset, SVHN dataset, vehicle number plate dataset, SVT dataset, and random datasets, and the accuracy is 95.22%, 92.25%, 96.69%, 94.58%, and 98.12%, respectively. According to quantitative and qualitative analysis, this approach is more promising in terms of accuracy and precision rate.

Download Full-text

Ethiopic Natural Scene Text Recognition Using Deep Learning Approaches

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Advances of Science and Technology ◽

10.1007/978-3-030-43690-2_36 ◽

2020 ◽

pp. 502-511

Author(s):

Direselign Addis ◽

Chuan-Ming Liu ◽

Van-Dai Ta

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Learning Approaches ◽

Natural Scene ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

A Novel Scene Text Recognition Method Based on Deep Learning

Computers Materials & Continua ◽

10.32604/cmc.2019.05595 ◽

2019 ◽

Vol 60 (2) ◽

pp. 781-794 ◽

Cited By ~ 4

Author(s):

Maosen Wang ◽

Shaozhang Niu ◽

Zhenguang Gao

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Recognition Method ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

A 307-fps 351.7-GOPs/W Deep Learning FPGA Accelerator for Real-Time Scene Text Recognition

2019 International Conference on Field-Programmable Technology (ICFPT) ◽

10.1109/icfpt47387.2019.00043 ◽

2019 ◽

Cited By ~ 1

Author(s):

Shirui Zhao ◽

Fengwei An ◽

Hao Yu

Keyword(s):

Deep Learning ◽

Real Time ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Scene Text Recognition: A Preliminary Investigation on Various Techniques and Implementation Using Deep Learning Classifiers

Advances in Intelligent Systems and Computing - International Conference on Innovative Computing and Communications ◽

10.1007/978-981-15-1286-5_20 ◽

2020 ◽

pp. 233-242

Author(s):

N. Bhavesh Shri Kumar ◽

Dasi Naga Brahma Krishna Sumanth Reddy ◽

K. Sairam ◽

J. Naren

Keyword(s):

Deep Learning ◽

Preliminary Investigation ◽

Text Recognition ◽

Learning Classifiers ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Scene Text Recognition Based on Deep Learning: A Brief Survey

2019 IEEE 11th International Conference on Communication Software and Networks (ICCSN) ◽

10.1109/iccsn.2019.8905316 ◽

2019 ◽

Author(s):

Yuxin Chen ◽

Yunxue Shao

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset

IEEE Access ◽

10.1109/access.2021.3100717 ◽

2021 ◽

pp. 1-1

Author(s):

Heba Hassan ◽

Ahmed El-Mahdy ◽

Mohamed E. Hussein.

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Decoupled Attention Network for Text Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6903 ◽

2020 ◽

Vol 34 (07) ◽

pp. 12216-12224 ◽

Cited By ~ 4

Author(s):

Tianwei Wang ◽

Yuanzhi Zhu ◽

Lianwen Jin ◽

Canjie Luo ◽

Xiaoxue Chen ◽

...

Keyword(s):

Input Image ◽

Text Recognition ◽

Visual Features ◽

Attention Network ◽

Considerable Research ◽

Handwritten Text ◽

Handwritten Text Recognition ◽

Scene Text ◽

Alignment Problem ◽

Scene Text Recognition

Text recognition has attracted considerable research interests because of its various applications. The cutting-edge text recognition methods are based on attention mechanisms. However, most of attention methods usually suffer from serious alignment problem due to its recurrency alignment operation, where the alignment relies on historical decoding results. To remedy this issue, we propose a decoupled attention network (DAN), which decouples the alignment operation from using historical decoding results. DAN is an effective, flexible and robust end-to-end text recognizer, which consists of three components: 1) a feature encoder that extracts visual features from the input image; 2) a convolutional alignment module that performs the alignment operation based on visual features from the encoder; and 3) a decoupled text decoder that makes final prediction by jointly using the feature map and attention maps. Experimental results show that DAN achieves state-of-the-art performance on multiple text recognition tasks, including offline handwritten text recognition and regular/irregular scene text recognition. Codes will be released.1

Download Full-text

Scene Text Recognition Based on Deep Learning

Lecture Notes in Electrical Engineering - Communications, Signal Processing, and Systems ◽

10.1007/978-981-13-9409-6_133 ◽

2020 ◽

pp. 1136-1143

Author(s):

Yunxue Shao ◽

Yuxin Chen

Keyword(s):

Deep Learning ◽

Text Recognition ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Multi-granularity Deep Local Representations for Irregular Scene Text Recognition

ACM/IMS Transactions on Data Science ◽

10.1145/3446971 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-18

Author(s):

Hongchao Gao ◽

Yujia Li ◽

Jiao Dai ◽

Xi Wang ◽

Jizhong Han ◽

...

Keyword(s):

State Of The Art ◽

Visual Representation ◽

Text Recognition ◽

Natural Scene ◽

Attention Network ◽

Training Time ◽

Scene Text ◽

Benchmark Datasets ◽

Local Representations ◽

Scene Text Recognition

Recognizing irregular text from natural scene images is challenging due to the unconstrained appearance of text, such as curvature, orientation, and distortion. Recent recognition networks regard this task as a text sequence labeling problem and most networks capture the sequence only from a single-granularity visual representation, which to some extent limits the performance of recognition. In this article, we propose a hierarchical attention network to capture multi-granularity deep local representations for recognizing irregular scene text. It consists of several hierarchical attention blocks, and each block contains a Local Visual Representation Module (LVRM) and a Decoder Module (DM). Based on the hierarchical attention network, we propose a scene text recognition network. The extensive experiments show that our proposed network achieves the state-of-the-art performance on several benchmark datasets including IIIT-5K, SVT, CUTE, SVT-Perspective, and ICDAR datasets under shorter training time.

Download Full-text

Arabic Cursive Text Recognition from Natural Scene Images

Applied Sciences ◽

10.3390/app9020236 ◽

2019 ◽

Vol 9 (2) ◽

pp. 236 ◽

Cited By ~ 6

Author(s):

Saad Ahmed ◽

Saeeda Naz ◽

Muhammad Razzak ◽

Rubiyah Yusof

Keyword(s):

Recognition System ◽

Document Image ◽

Text Recognition ◽

Chinese Script ◽

Challenging Problem ◽

Future Directions ◽

Scene Text ◽

Comprehensive Survey ◽

Recognition Systems ◽

Scene Text Recognition

This paper presents a comprehensive survey on Arabic cursive scene text recognition. The recent years’ publications in this field have witnessed the interest shift of document image analysis researchers from recognition of optical characters to recognition of characters appearing in natural images. Scene text recognition is a challenging problem due to the text having variations in font styles, size, alignment, orientation, reflection, illumination change, blurriness and complex background. Among cursive scripts, Arabic scene text recognition is contemplated as a more challenging problem due to joined writing, same character variations, a large number of ligatures, the number of baselines, etc. Surveys on the Latin and Chinese script-based scene text recognition system can be found, but the Arabic like scene text recognition problem is yet to be addressed in detail. In this manuscript, a description is provided to highlight some of the latest techniques presented for text classification. The presented techniques following a deep learning architecture are equally suitable for the development of Arabic cursive scene text recognition systems. The issues pertaining to text localization and feature extraction are also presented. Moreover, this article emphasizes the importance of having benchmark cursive scene text dataset. Based on the discussion, future directions are outlined, some of which may provide insight about cursive scene text to researchers.

Download Full-text