Multilingual Scene Text Detection Using Gradient Morphology

Dibyajyoti Dhar; Neelotpal Chakraborty; Sayan Choudhury; Ashis Paul; Ayatullah Faruk Mollah; Subhadip Basu; Ram Sarkar

doi:10.4018/ijcvip.2020070103

Multilingual Scene Text Detection Using Gradient Morphology

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2020070103 ◽

2020 ◽

Vol 10 (3) ◽

pp. 31-43 ◽

Cited By ~ 2

Author(s):

Dibyajyoti Dhar ◽

Neelotpal Chakraborty ◽

Sayan Choudhury ◽

Ashis Paul ◽

Ayatullah Faruk Mollah ◽

...

Keyword(s):

State Of The Art ◽

High Sensitivity ◽

Text Detection ◽

Interesting Problem ◽

Natural Scene ◽

The Past ◽

Scene Text Detection ◽

Scene Text ◽

Multiple Languages ◽

Natural Scene Images

Text detection in natural scene images is an interesting problem in the field of information retrieval. Several methods have been proposed over the past few decades for scene text detection. However, the robustness and efficiency of these methods are downgraded due to high sensitivity towards various complexities of an image. Also, in multi-lingual environment where texts may occur in multiple languages, a method may not be suitable for detecting scene texts in certain languages. To counter these challenges, a gradient morphology-based method is proposed in this paper that proves to be robust against image complexities and efficiently detects scene texts irrespective of their languages. The method is validated using low quality images from standard multi-lingual datasets like MSRA-TD500 and MLe2e. The performance of the method is compared with that of some state-of-the-art methods, and comparably better results are observed.

Download Full-text

Scene Text Detection with Supervised Pyramid Context Network

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019038 ◽

2019 ◽

Vol 33 ◽

pp. 9038-9045 ◽

Cited By ~ 30

Author(s):

Enze Xie ◽

Yuhang Zang ◽

Shuai Shao ◽

Gang Yu ◽

Cong Yao ◽

...

Keyword(s):

Semantic Information ◽

State Of The Art ◽

Text Detection ◽

Detection Methods ◽

Natural Scenes ◽

The Past ◽

Scene Text Detection ◽

Scene Text ◽

Previous State ◽

Feature Pyramid

Scene text detection methods based on deep learning have achieved remarkable results over the past years. However, due to the high diversity and complexity of natural scenes, previous state-of-the-art text detection methods may still produce a considerable amount of false positives, when applied to images captured in real-world environments. To tackle this issue, mainly inspired by Mask R-CNN, we propose in this paper an effective model for scene text detection, which is based on Feature Pyramid Network (FPN) and instance segmentation. We propose a supervised pyramid context network (SPCNET) to precisely locate text regions while suppressing false positives.Benefited from the guidance of semantic information and sharing FPN, SPCNET obtains significantly enhanced performance while introducing marginal extra computation. Experiments on standard datasets demonstrate that our SPCNET clearly outperforms start-of-the-art methods. Specifically, it achieves an F-measure of 92.1% on ICDAR2013, 87.2% on ICDAR2015, 74.1% on ICDAR2017 MLT and 82.9% on

Download Full-text

Natural Scene Text Detection Based on Multi-Channel FASText

Proceedings of the 2017 2nd International Conference on Automatic Control and Information Engineering (ICACIE 2017) ◽

10.2991/icacie-17.2017.4 ◽

2017 ◽

Author(s):

Chenfeng Guo ◽

Juhua Liu

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text

Download Full-text

Natural scene text detection with MC–MR candidate extraction and coarse-to-fine filtering

Neurocomputing ◽

10.1016/j.neucom.2017.03.078 ◽

2017 ◽

Vol 260 ◽

pp. 112-122 ◽

Cited By ~ 17

Author(s):

Chunna Tian ◽

Yong Xia ◽

Xiangnan Zhang ◽

Xinbo Gao

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Coarse To Fine

Download Full-text

A Scene Text Detector for Text with Arbitrary Shapes

Mathematical Problems in Engineering ◽

10.1155/2020/8916028 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Weijia Wu ◽

Jici Xing ◽

Cheng Yang ◽

Yuxing Wang ◽

Hong Zhou

Keyword(s):

State Of The Art ◽

Recognition Task ◽

Text Detection ◽

Supplementary Information ◽

Complex Environment ◽

Irregular Shapes ◽

Scene Text Detection ◽

Scene Text ◽

F Measure ◽

Instance Segmentation

The performance of text detection is crucial for the subsequent recognition task. Currently, the accuracy of the text detector still needs further improvement, particularly those with irregular shapes in a complex environment. We propose a pixel-wise method based on instance segmentation for scene text detection. Specifically, a text instance is split into five components: a Text Skeleton and four Directional Pixel Regions, then restoring itself based on these elements and receiving supplementary information from other areas when one fails. Besides, a Confidence Scoring Mechanism is designed to filter characters similar to text instances. Experiments on several challenging benchmarks demonstrate that our method achieves state-of-the-art results in scene text detection with an F-measure of 84.6% on Total-Text and 86.3% on CTW1500.

Download Full-text

Natural scene text detection and recognition with a three-stage local phase-based algorithm

Applications of Digital Image Processing XLI ◽

10.1117/12.2320646 ◽

2018 ◽

Author(s):

Julia Diaz-Escobar ◽

Vitaly Kober

Keyword(s):

Text Detection ◽

Local Phase ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Detection And Recognition

Download Full-text

Natural Scene Text Detection Based On Multi-level Fusion Proposal Network

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/790/1/012051 ◽

2020 ◽

Vol 790 ◽

pp. 012051

Author(s):

Tong Li ◽

Wanggen Li ◽

Nannan Zhu ◽

Xuecheng Gong ◽

Jiajia Chen

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Multi Level ◽

Level Fusion

Download Full-text

Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis

Pattern Recognition Letters ◽

10.1016/j.patrec.2015.04.005 ◽

2015 ◽

Vol 60-61 ◽

pp. 41-47 ◽

Cited By ~ 14

Author(s):

Xiaobing Wang ◽

Yonghong Song ◽

Yuanlin Zhang ◽

Jingmin Xin

Keyword(s):

Random Field ◽

Conditional Random Field ◽

Higher Order ◽

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Layer Segmentation

Download Full-text

Real-time Arabic scene text detection using fully convolutional neural networks

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i2.pp1634-1640 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1634

Author(s):

Rajae Moumen ◽

Raddouane Chiheb ◽

Rdouan Faizi

Keyword(s):

Real Time ◽

Data Augmentation ◽

State Of The Art ◽

Arabic Language ◽

Text Detection ◽

The State ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Scene Text Detection ◽

Scene Text

The aim of this research is to propose a fully convolutional approach to address the problem of real-time scene text detection for Arabic language. Text detection is performed using a two-steps multi-scale approach. The first step uses light-weighted fully convolutional network: TextBlockDetector FCN, an adaptation of VGG-16 to eliminate non-textual elements, localize wide scale text and give text scale estimation. The second step determines narrow scale range of text using fully convolutional network for maximum performance. To evaluate the system, we confront the results of the framework to the results obtained with single VGG-16 fully deployed for text detection in one-shot; in addition to previous results in the state-of-the-art. For training and testing, we initiate a dataset of 575 images manually processed along with data augmentation to enrich training process. The system scores a precision of 0.651 vs 0.64 in the state-of-the-art and a FPS of 24.3 vs 31.7 for a VGG-16 fully deployed.

Download Full-text

Reduced Annotation Based on Deep Active Learning for Arabic Text Detection in Natural Scene Images

10.36227/techrxiv.17327963 ◽

2021 ◽

Author(s):

Khalil Boukthir ◽

Abdulrahman M. Qahtani ◽

Omar Almutiry ◽

habib dhahri ◽

Adel Alimi

Keyword(s):

Active Learning ◽

Text Detection ◽

Training Data ◽

Arabic Text ◽

Natural Scene ◽

Novel Approach ◽

Training Samples ◽

Scene Text ◽

Text Images ◽

Natural Scene Images

<div>- A novel approach is presented to reduced annotation based on Deep Active Learning for Arabic text detection in Natural Scene Images.</div><div>- A new Arabic text images dataset (7k images) using the Google Street View service named TSVD.</div><div>- A new semi-automatic method for generating natural scene text images from the streets.</div><div>- Training samples is reduced to 1/5 of the original training size on average.</div><div>- Much less training data to achieve better dice index : 0.84</div>

Download Full-text

Reduced Annotation Based on Deep Active Learning for Arabic Text Detection in Natural Scene Images

10.36227/techrxiv.17327963.v1 ◽

2021 ◽

Author(s):

Khalil Boukthir ◽

Abdulrahman M. Qahtani ◽

Omar Almutiry ◽

habib dhahri ◽

Adel Alimi

Keyword(s):

Active Learning ◽

Text Detection ◽

Training Data ◽

Arabic Text ◽

Natural Scene ◽

Novel Approach ◽

Training Samples ◽

Scene Text ◽

Text Images ◽

Natural Scene Images

Download Full-text