Text detection in natural images based on multi-scale edge detetion and classification

Natural scene text is broadly observed in our everyday life and has countless imperative multimedia applications. Natural scene text typically show signs of outsized discrepancy in font and languages but endures from low resolution, occlusions and intricate background. An android based application Smart Eye which works in offline mode is proposed here for text detection which robustly perceives the text in natural images in real time and translates the text present in image to speech which can assist people with vision disability. The spoken is also converted to text which can aid people with hearing disability.

Download Full-text

Hi-Fi: Hierarchical Feature Integration for Skeleton Detection

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/166 ◽

2018 ◽

Cited By ~ 13

Author(s):

Kai Zhao ◽

Wei Shen ◽

Shanghua Gao ◽

Dandan Li ◽

Ming-Ming Cheng

Keyword(s):

Performance Improvement ◽

State Of The Art ◽

Natural Images ◽

Feature Integration ◽

Detection Problem ◽

Multi Scale ◽

Integration Mechanism ◽

Object Parts ◽

High Level ◽

Different Levels

In natural images, the scales (thickness) of object skeletons may dramatically vary among objects and object parts. Thus, robust skeleton detection requires powerful multi-scale feature integration ability. To address this issue, we present a new convolutional neural network (CNN) architecture by introducing a novel hierarchical feature integration mechanism, named Hi-Fi, to address the object skeleton detection problem. The proposed CNN-based approach intrinsically captures high-level semantics from deeper layers, as well as low-level details from shallower layers. By hierarchically integrating different CNN feature levels with bidirectional guidance, our approach (1) enables mutual refinement across features of different levels, and (2) possesses the strong ability to capture both rich object context and high-resolution details. Experimental results show that our method significantly outperforms the state-of-the-art methods in terms of effectively fusing features from very different scales, as evidenced by a considerable performance improvement on several benchmarks.

Download Full-text

An implicit Markov random field model for the multi-scale oriented representations of natural images

2009 IEEE Conference on Computer Vision and Pattern Recognition ◽

10.1109/cvprw.2009.5206797 ◽

2009 ◽

Author(s):

Siwei Lyu

Keyword(s):

Random Field ◽

Markov Random Field ◽

Field Model ◽

Natural Images ◽

Random Field Model ◽

Markov Random Field Model ◽

Multi Scale ◽

Markov Random

Download Full-text

Multi Orientation Text Detection in Natural Imagery

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2018100104 ◽

2018 ◽

Vol 8 (4) ◽

pp. 41-56

Author(s):

Deepak Kumar ◽

Ramandeep Singh

Keyword(s):

Performance Metrics ◽

Research Work ◽

Text Detection ◽

Natural Images ◽

Hard Copy ◽

Future Perspective ◽

Depth Study ◽

Benchmark Datasets ◽

And Performance ◽

Curved Text

Constant advancement and growth in digital technology is swiftly changing the scenario of text detection from hard copy images to natural images. An in-depth study of the previous research work reveals that though a lot of research work has been done on text detection and recognition in natural scene images, but most of the researchers have concluded their survey either on a horizontal or near to horizontal texts. Their survey somewhat speaks about multi-orientation text detection, but the curved text detection in natural images escaped their attention. It has necessitated exploration on the vital aspect of text detection field where detailed study of horizontal, near to horizontal, multi-orientation, and curved text finds a place in a single cover. To achieve this goal, the present study will focus on fundamental understanding, existing challenges, and the proven algorithms for text detection in natural images. The authors discuss the future perspective of recent advances in text detection in natural images with various benchmark datasets and performance metrics.

Download Full-text