scholarly journals Survey on Text Detection and Text Recognition Methodology for Natural Images

Author(s):  
Hongchao Gao ◽  
Yujia Li ◽  
Xi Wang ◽  
Jizhong Han ◽  
Ruixuan Li

2020 ◽  
Vol 2020 ◽  
pp. 1-15
Author(s):  
Fan Zhang ◽  
Jiaxing Luan ◽  
Zhichao Xu ◽  
Wei Chen

Deep learning-based object detection method has been applied in various fields, such as ITS (intelligent transportation systems) and ADS (autonomous driving systems). Meanwhile, text detection and recognition in different scenes have also attracted much attention and research effort. In this article, we propose a new object-text detection and recognition method termed “DetReco” to detect objects and texts and recognize the text contents. The proposed method is composed of object-text detection network and text recognition network. YOLOv3 is used as the algorithm for the object-text detection task and CRNN is employed to deal with the text recognition task. We combine the datasets of general objects and texts together to train the networks. At test time, the detection network detects various objects in an image. Then, the text images are passed to the text recognition network to derive the text contents. The experiments show that the proposed method achieves 78.3 mAP (mean Average Precision) for general objects and 72.8 AP (Average Precision) for texts in regard to detection performance. Furthermore, the proposed method is able to detect and recognize affine transformed or occluded texts with robustness. In addition, for the texts detected around general objects, the text contents can be used as the identifier to distinguish the object.


2016 ◽  
Vol 2016 (17) ◽  
pp. 1-8 ◽  
Author(s):  
Narges Honarvar Nazari ◽  
Tianxiang Tan ◽  
Yao-Yi Chiang

2021 ◽  
Vol 2137 (1) ◽  
pp. 012022
Author(s):  
Da Lu ◽  
Jia Liu ◽  
Helong Li

Abstract Recognizing irregular text in real industrial scenes is a challenging task due to the background clutter, low resolutions or distortions. In this work, an attention-based text detection and recognition method for terminals of current transformer’s secondary circuit is proposed. It consists of three major components: pre-processing, text detection and text recognition. In text recognition module, a novel spatial temporal embedding is designed to better utilize the positional information. During training, the proposed framework only requires sequence-level annotations, instead of extra fine-grained character-level boxes or segmentation masks as in previous work. Despite its simplicity, the proposed method achieves good performance on the dataset collected in actual working scene.


Natural scene text is broadly observed in our everyday life and has countless imperative multimedia applications. Natural scene text typically show signs of outsized discrepancy in font and languages but endures from low resolution, occlusions and intricate background. An android based application Smart Eye which works in offline mode is proposed here for text detection which robustly perceives the text in natural images in real time and translates the text present in image to speech which can assist people with vision disability. The spoken is also converted to text which can aid people with hearing disability.


Sign in / Sign up

Export Citation Format

Share Document