Efficient Character Skew Rectification in Scene Text Images

A Robot Object Recognition Method Based on Scene Text Reading in Home Environments

Sensors ◽

10.3390/s21051919 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1919

Author(s):

Shuhua Liu ◽

Huixin Xu ◽

Qi Li ◽

Fei Zhang ◽

Kun Hou

Keyword(s):

Object Recognition ◽

Recognition Accuracy ◽

Multiple Objects ◽

Recognition Method ◽

Home Environments ◽

Complex Scenes ◽

Scene Text ◽

Text Reading ◽

Text Images ◽

Chinese And English

With the aim to solve issues of robot object recognition in complex scenes, this paper proposes an object recognition method based on scene text reading. The proposed method simulates human-like behavior and accurately identifies objects with texts through careful reading. First, deep learning models with high accuracy are adopted to detect and recognize text in multi-view. Second, datasets including 102,000 Chinese and English scene text images and their inverse are generated. The F-measure of text detection is improved by 0.4% and the recognition accuracy is improved by 1.26% because the model is trained by these two datasets. Finally, a robot object recognition method is proposed based on the scene text reading. The robot detects and recognizes texts in the image and then stores the recognition results in a text file. When the user gives the robot a fetching instruction, the robot searches for corresponding keywords from the text files and achieves the confidence of multiple objects in the scene image. Then, the object with the maximum confidence is selected as the target. The results show that the robot can accurately distinguish objects with arbitrary shape and category, and it can effectively solve the problem of object recognition in home environments.

Download Full-text

A novel method for binarization of scene text images and its application in text identification

Pattern Analysis and Applications ◽

10.1007/s10044-018-0687-2 ◽

2018 ◽

Vol 22 (4) ◽

pp. 1361-1375 ◽

Cited By ~ 3

Author(s):

Ranjit Ghoshal ◽

Anandarup Roy ◽

Ayan Banerjee ◽

Bibhas Chandra Dhara ◽

Swapan K. Parui

Keyword(s):

Scene Text ◽

Novel Method ◽

Text Images

Download Full-text

A Survey on Text Information Extraction from Born-Digital and Scene Text Images

Proceedings of the National Academy of Sciences India Section A Physical Sciences ◽

10.1007/s40010-017-0478-y ◽

2018 ◽

Vol 89 (1) ◽

pp. 77-101 ◽

Cited By ~ 3

Author(s):

S. P. Faustina Joan ◽

S. Valli

Keyword(s):

Information Extraction ◽

Scene Text ◽

Text Information ◽

Text Images

Download Full-text

SynthText3D: synthesizing scene text images from 3D virtual worlds

Science China Information Sciences ◽

10.1007/s11432-019-2737-0 ◽

2020 ◽

Vol 63 (2) ◽

Cited By ~ 4

Author(s):

Minghui Liao ◽

Boyu Song ◽

Shangbang Long ◽

Minghang He ◽

Cong Yao ◽

...

Keyword(s):

Virtual Worlds ◽

3D Virtual Worlds ◽

Scene Text ◽

Text Images

Download Full-text

Reduced Annotation Based on Deep Active Learning for Arabic Text Detection in Natural Scene Images

10.36227/techrxiv.17327963 ◽

2021 ◽

Author(s):

Khalil Boukthir ◽

Abdulrahman M. Qahtani ◽

Omar Almutiry ◽

habib dhahri ◽

Adel Alimi

Keyword(s):

Active Learning ◽

Text Detection ◽

Training Data ◽

Arabic Text ◽

Natural Scene ◽

Novel Approach ◽

Training Samples ◽

Scene Text ◽

Text Images ◽

Natural Scene Images

<div>- A novel approach is presented to reduced annotation based on Deep Active Learning for Arabic text detection in Natural Scene Images.</div><div>- A new Arabic text images dataset (7k images) using the Google Street View service named TSVD.</div><div>- A new semi-automatic method for generating natural scene text images from the streets.</div><div>- Training samples is reduced to 1/5 of the original training size on average.</div><div>- Much less training data to achieve better dice index : 0.84</div>

Download Full-text

Reduced Annotation Based on Deep Active Learning for Arabic Text Detection in Natural Scene Images

10.36227/techrxiv.17327963.v1 ◽

2021 ◽

Author(s):

Khalil Boukthir ◽

Abdulrahman M. Qahtani ◽

Omar Almutiry ◽

habib dhahri ◽

Adel Alimi

Keyword(s):

Active Learning ◽

Text Detection ◽

Training Data ◽

Arabic Text ◽

Natural Scene ◽

Novel Approach ◽

Training Samples ◽

Scene Text ◽

Text Images ◽

Natural Scene Images

<div>- A novel approach is presented to reduced annotation based on Deep Active Learning for Arabic text detection in Natural Scene Images.</div><div>- A new Arabic text images dataset (7k images) using the Google Street View service named TSVD.</div><div>- A new semi-automatic method for generating natural scene text images from the streets.</div><div>- Training samples is reduced to 1/5 of the original training size on average.</div><div>- Much less training data to achieve better dice index : 0.84</div>

Download Full-text

Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach

Journal of Ambient Intelligence and Humanized Computing ◽

10.1007/s12652-020-02528-4 ◽

2020 ◽

Author(s):

Neelotpal Chakraborty ◽

Soumyadeep Kundu ◽

Sayantan Paul ◽

Ayatullah Faruk Mollah ◽

Subhadip Basu ◽

...

Keyword(s):

Language Identification ◽

Classifier Ensemble ◽

Ensemble Approach ◽

Scene Text ◽

Text Images

Download Full-text

Selective Super-Resolution for Scene Text Images

2019 International Conference on Document Analysis and Recognition (ICDAR) ◽

10.1109/icdar.2019.00071 ◽

2019 ◽

Author(s):

Ryo Nakao ◽

Brian Kenji Iwana ◽

Seiichi Uchida

Keyword(s):

Super Resolution ◽

Scene Text ◽

Text Images

Download Full-text

Residual attention-based multi-scale script identification in scene text images

Neurocomputing ◽

10.1016/j.neucom.2020.09.015 ◽

2021 ◽

Vol 421 ◽

pp. 222-233

Author(s):

Mengkai Ma ◽

Qiu-Feng Wang ◽

Shan Huang ◽

Shen Huang ◽

Yannis Goulermas ◽

...

Keyword(s):

Script Identification ◽

Multi Scale ◽

Scene Text ◽

Text Images

Download Full-text

Learning to Draw Text in Natural Images with Conditional Adversarial Networks

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/101 ◽

2019 ◽

Author(s):

Shancheng Fang ◽

Hongtao Xie ◽

Jianjun Chen ◽

Jianlong Tan ◽

Yongdong Zhang

Keyword(s):

Image Synthesis ◽

Recognition Algorithm ◽

Natural Images ◽

Text Recognition ◽

Local Character ◽

Adversarial Networks ◽

Scene Text ◽

Style Consistency ◽

Text Images ◽

Scene Text Recognition

In this work, we propose an entirely learning-based method to automatically synthesize text sequence in natural images leveraging conditional adversarial networks. As vanilla GANs are clumsy to capture structural text patterns, directly employing GANs for text image synthesis typically results in illegible images. Therefore, we design a two-stage architecture to generate repeated characters in images. Firstly, a character generator attempts to synthesize local character appearance independently, so that the legible characters in sequence can be obtained. To achieve style consistency of characters, we propose a novel style loss based on variance-minimization. Secondly, we design a pixel-manipulation word generator constrained by self-regularization, which learns to convert local characters to plausible word image. Experiments on SVHN dataset and ICDAR, IIIT5K datasets demonstrate our method is able to synthesize visually appealing text images. Besides, we also show the high-quality images synthesized by our method can be used to boost the performance of a scene text recognition algorithm.

Download Full-text