Text detection and script identification in natural scene images using deep learning

Text in scene images can provide useful and vital information for content-based image analysis. Therefore, text detection and script identification in images are an important task. In this paper, we propose a new method for text detection in natural scene images, particularly for Arabic text, based on a bottom-up approach where four principal steps can be highlighted. The detection of extremely stable and homogeneous regions of interest (ROIs) is based on the Color Stability and Homogeneity Regions (CSHR) proposed technique. These regions are then labeled as textual or non-textual ROI. This identification is based on a structural approach. The textual ROIs are grouped to constitute zones according to spatial relations between them. Finally, the textual or non-textual nature of the constituted zones is refined. This last identification is based on handcrafted features and on features built from a Convolutional Neural Network (CNN) after learning. The proposed method was evaluated on the databases used for text detection in natural scene images: the competitions organized in 2017 edition of the International Conference on Document Analysis and Recognition (ICDAR2017), the Urdu-text database and our Natural Scene Image Database for Arabic Text detection (NSIDAT) database. The obtained experimental results seem to be interesting.

Download Full-text

Mining discriminative patches for script identification in natural scene images

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-200260 ◽

2021 ◽

Vol 40 (1) ◽

pp. 551-563

Author(s):

Liqiong Lu ◽

Dong Wu ◽

Ziwei Tang ◽

Yaohua Yi ◽

Faliang Huang

Keyword(s):

Neural Networks ◽

Experimental Results ◽

The Other ◽

Natural Scene ◽

Fixed Size ◽

Script Identification ◽

Aspect Ratios ◽

Novel Approach ◽

Public Datasets ◽

Natural Scene Images

This paper focuses on script identification in natural scene images. Traditional CNNs (Convolution Neural Networks) cannot solve this problem perfectly for two reasons: one is the arbitrary aspect ratios of scene images which bring much difficulty to traditional CNNs with a fixed size image as the input. And the other is that some scripts with minor differences are easily confused because they share a subset of characters with the same shapes. We propose a novel approach combing Score CNN, Attention CNN and patches. Attention CNN is utilized to determine whether a patch is a discriminative patch and calculate the contribution weight of the discriminative patch to script identification of the whole image. Score CNN uses a discriminative patch as input and predict the score of each script type. Firstly patches with the same size are extracted from the scene images. Secondly these patches are used as inputs to Score CNN and Attention CNN to train two patch-level classifiers. Finally, the results of multiple discriminative patches extracted from the same image via the above two classifiers are fused to obtain the script type of this image. Using patches with the same size as inputs to CNN can avoid the problems caused by arbitrary aspect ratios of scene images. The trained classifiers can mine discriminative patches to accurately identify some confusing scripts. The experimental results show the good performance of our approach on four public datasets.

Download Full-text

Devanagari Text Detection From Natural Scene Images

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2020070104 ◽

2020 ◽

Vol 10 (3) ◽

pp. 44-59

Author(s):

Sankirti Sandeep Shiravale ◽

R. Jayadevan ◽

Sanjeev S. Sannakki

Keyword(s):

Edge Detection ◽

Image Understanding ◽

Text Detection ◽

Experimental Results ◽

Combined Approach ◽

Natural Scene ◽

Light Conditions ◽

The Individual ◽

Natural Scene Images ◽

Better Than

Text present in a camera captured scene images is semantically rich and can be used for image understanding. Automatic detection, extraction, and recognition of text are crucial in image understanding applications. Text detection from natural scene images is a tedious task due to complex background, uneven light conditions, multi-coloured and multi-sized font. Two techniques, namely ‘edge detection' and ‘colour-based clustering', are combined in this paper to detect text in scene images. Region properties are used for elimination of falsely generated annotations. A dataset of 1250 images is created and used for experimentation. Experimental results show that the combined approach performs better than the individual approaches.

Download Full-text

Integrated Method for Text Detection in Natural Scene Images

KSII Transactions on Internet and Information Systems ◽

10.3837/tiis.2016.11.021 ◽

2016 ◽

Keyword(s):

Text Detection ◽

Natural Scene ◽

Integrated Method ◽

Natural Scene Images

Download Full-text

Text Detection Based on Text Shape Feature Analysis with Intelligent Grouping in Natural Scene Images

Mathematical Modeling and Computational Tools - Springer Proceedings in Mathematics & Statistics ◽

10.1007/978-981-15-3615-1_33 ◽

2020 ◽

pp. 467-479

Author(s):

D. Kavitha ◽

V. Radha

Keyword(s):

Text Detection ◽

Feature Analysis ◽

Shape Feature ◽

Natural Scene ◽

Natural Scene Images

Download Full-text

Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images

Pattern Recognition ◽

10.1016/j.patcog.2019.106986 ◽

2019 ◽

Vol 96 ◽

pp. 106986 ◽

Cited By ~ 9

Author(s):

Zhuoyao Zhong ◽

Lei Sun ◽

Qiang Huo

Keyword(s):

Text Detection ◽

Natural Scene ◽

Localization Accuracy ◽

Natural Scene Images

Download Full-text

A robust arbitrary text detection system for natural scene images

Expert Systems with Applications ◽

10.1016/j.eswa.2014.07.008 ◽

2014 ◽

Vol 41 (18) ◽

pp. 8027-8048 ◽

Cited By ~ 96

Author(s):

Anhar Risnumawan ◽

Palaiahankote Shivakumara ◽

Chee Seng Chan ◽

Chew Lim Tan

Keyword(s):

Detection System ◽

Text Detection ◽

Natural Scene ◽

Natural Scene Images

Download Full-text

Text Detection in Natural Scene Images with Stroke Width Clustering and Superpixel

Advances in Multimedia Information Processing – PCM 2014 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-13168-9_13 ◽

2014 ◽

pp. 123-132 ◽

Cited By ~ 3

Author(s):

Shuang Liu ◽

Yu Zhou ◽

Yongzheng Zhang ◽

Yipeng Wang ◽

Weiyao Lin

Keyword(s):

Text Detection ◽

Natural Scene ◽

Stroke Width ◽

Natural Scene Images

Download Full-text

Multilingual Scene Text Detection Using Gradient Morphology

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2020070103 ◽

2020 ◽

Vol 10 (3) ◽

pp. 31-43 ◽

Cited By ~ 2

Author(s):

Dibyajyoti Dhar ◽

Neelotpal Chakraborty ◽

Sayan Choudhury ◽

Ashis Paul ◽

Ayatullah Faruk Mollah ◽

...

Keyword(s):

State Of The Art ◽

High Sensitivity ◽

Text Detection ◽

Interesting Problem ◽

Natural Scene ◽

The Past ◽

Scene Text Detection ◽

Scene Text ◽

Multiple Languages ◽

Natural Scene Images

Text detection in natural scene images is an interesting problem in the field of information retrieval. Several methods have been proposed over the past few decades for scene text detection. However, the robustness and efficiency of these methods are downgraded due to high sensitivity towards various complexities of an image. Also, in multi-lingual environment where texts may occur in multiple languages, a method may not be suitable for detecting scene texts in certain languages. To counter these challenges, a gradient morphology-based method is proposed in this paper that proves to be robust against image complexities and efficiently detects scene texts irrespective of their languages. The method is validated using low quality images from standard multi-lingual datasets like MSRA-TD500 and MLe2e. The performance of the method is compared with that of some state-of-the-art methods, and comparably better results are observed.

Download Full-text