Smart Eye: A Handheld Device Based Application for Text Detection and Speech Conversion

Natural scene text is broadly observed in our everyday life and has countless imperative multimedia applications. Natural scene text typically show signs of outsized discrepancy in font and languages but endures from low resolution, occlusions and intricate background. An android based application Smart Eye which works in offline mode is proposed here for text detection which robustly perceives the text in natural images in real time and translates the text present in image to speech which can assist people with vision disability. The spoken is also converted to text which can aid people with hearing disability.

Download Full-text

Natural Scene Text Detection Based on Multi-Channel FASText

Proceedings of the 2017 2nd International Conference on Automatic Control and Information Engineering (ICACIE 2017) ◽

10.2991/icacie-17.2017.4 ◽

2017 ◽

Author(s):

Chenfeng Guo ◽

Juhua Liu

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text

Download Full-text

Real-Time Scene Text Detection Based on Stroke Model

2014 22nd International Conference on Pattern Recognition ◽

10.1109/icpr.2014.537 ◽

2014 ◽

Cited By ~ 6

Author(s):

Yi Liu ◽

Dongming Zhang ◽

Yongdong Zhang ◽

Shouxun Lin

Keyword(s):

Real Time ◽

Text Detection ◽

Stroke Model ◽

Scene Text Detection ◽

Scene Text

Download Full-text

Natural scene text detection with MC–MR candidate extraction and coarse-to-fine filtering

Neurocomputing ◽

10.1016/j.neucom.2017.03.078 ◽

2017 ◽

Vol 260 ◽

pp. 112-122 ◽

Cited By ~ 17

Author(s):

Chunna Tian ◽

Yong Xia ◽

Xiangnan Zhang ◽

Xinbo Gao

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Coarse To Fine

Download Full-text

Natural scene text detection and recognition with a three-stage local phase-based algorithm

Applications of Digital Image Processing XLI ◽

10.1117/12.2320646 ◽

2018 ◽

Author(s):

Julia Diaz-Escobar ◽

Vitaly Kober

Keyword(s):

Text Detection ◽

Local Phase ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Detection And Recognition

Download Full-text

Multilingual Scene Text Detection Using Gradient Morphology

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2020070103 ◽

2020 ◽

Vol 10 (3) ◽

pp. 31-43 ◽

Cited By ~ 2

Author(s):

Dibyajyoti Dhar ◽

Neelotpal Chakraborty ◽

Sayan Choudhury ◽

Ashis Paul ◽

Ayatullah Faruk Mollah ◽

...

Keyword(s):

State Of The Art ◽

High Sensitivity ◽

Text Detection ◽

Interesting Problem ◽

Natural Scene ◽

The Past ◽

Scene Text Detection ◽

Scene Text ◽

Multiple Languages ◽

Natural Scene Images

Text detection in natural scene images is an interesting problem in the field of information retrieval. Several methods have been proposed over the past few decades for scene text detection. However, the robustness and efficiency of these methods are downgraded due to high sensitivity towards various complexities of an image. Also, in multi-lingual environment where texts may occur in multiple languages, a method may not be suitable for detecting scene texts in certain languages. To counter these challenges, a gradient morphology-based method is proposed in this paper that proves to be robust against image complexities and efficiently detects scene texts irrespective of their languages. The method is validated using low quality images from standard multi-lingual datasets like MSRA-TD500 and MLe2e. The performance of the method is compared with that of some state-of-the-art methods, and comparably better results are observed.

Download Full-text

Natural Scene Text Detection Based On Multi-level Fusion Proposal Network

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/790/1/012051 ◽

2020 ◽

Vol 790 ◽

pp. 012051

Author(s):

Tong Li ◽

Wanggen Li ◽

Nannan Zhu ◽

Xuecheng Gong ◽

Jiajia Chen

Keyword(s):

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Multi Level ◽

Level Fusion

Download Full-text

Multi-Oriented Real-Time Arabic Scene Text Detection with Deep Fully Convolutional Networks

2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA) ◽

10.1109/aiccsa47632.2019.9035340 ◽

2019 ◽

Cited By ~ 1

Author(s):

M. Saifeddine Hadj Sassi ◽

Ines Beltaief ◽

Manel Zekri ◽

Sadok Ben Yahia

Keyword(s):

Real Time ◽

Text Detection ◽

Convolutional Networks ◽

Fully Convolutional Networks ◽

Scene Text Detection ◽

Scene Text

Download Full-text

Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis

Pattern Recognition Letters ◽

10.1016/j.patrec.2015.04.005 ◽

2015 ◽

Vol 60-61 ◽

pp. 41-47 ◽

Cited By ~ 14

Author(s):

Xiaobing Wang ◽

Yonghong Song ◽

Yuanlin Zhang ◽

Jingmin Xin

Keyword(s):

Random Field ◽

Conditional Random Field ◽

Higher Order ◽

Text Detection ◽

Natural Scene ◽

Scene Text Detection ◽

Scene Text ◽

Layer Segmentation

Download Full-text

Real-time Arabic scene text detection using fully convolutional neural networks

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i2.pp1634-1640 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1634

Author(s):

Rajae Moumen ◽

Raddouane Chiheb ◽

Rdouan Faizi

Keyword(s):

Real Time ◽

Data Augmentation ◽

State Of The Art ◽

Arabic Language ◽

Text Detection ◽

The State ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Scene Text Detection ◽

Scene Text

The aim of this research is to propose a fully convolutional approach to address the problem of real-time scene text detection for Arabic language. Text detection is performed using a two-steps multi-scale approach. The first step uses light-weighted fully convolutional network: TextBlockDetector FCN, an adaptation of VGG-16 to eliminate non-textual elements, localize wide scale text and give text scale estimation. The second step determines narrow scale range of text using fully convolutional network for maximum performance. To evaluate the system, we confront the results of the framework to the results obtained with single VGG-16 fully deployed for text detection in one-shot; in addition to previous results in the state-of-the-art. For training and testing, we initiate a dataset of 575 images manually processed along with data augmentation to enrich training process. The system scores a precision of 0.651 vs 0.64 in the state-of-the-art and a FPS of 24.3 vs 31.7 for a VGG-16 fully deployed.

Download Full-text

Reduced Annotation Based on Deep Active Learning for Arabic Text Detection in Natural Scene Images

10.36227/techrxiv.17327963 ◽

2021 ◽

Author(s):

Khalil Boukthir ◽

Abdulrahman M. Qahtani ◽

Omar Almutiry ◽

habib dhahri ◽

Adel Alimi

Keyword(s):

Active Learning ◽

Text Detection ◽

Training Data ◽

Arabic Text ◽

Natural Scene ◽

Novel Approach ◽

Training Samples ◽

Scene Text ◽

Text Images ◽

Natural Scene Images

<div>- A novel approach is presented to reduced annotation based on Deep Active Learning for Arabic text detection in Natural Scene Images.</div><div>- A new Arabic text images dataset (7k images) using the Google Street View service named TSVD.</div><div>- A new semi-automatic method for generating natural scene text images from the streets.</div><div>- Training samples is reduced to 1/5 of the original training size on average.</div><div>- Much less training data to achieve better dice index : 0.84</div>

Download Full-text