Text Detection in Natural Images Using Localized Stroke Width Transform

Author(s):  
Wenyan Dong ◽  
Zhouhui Lian ◽  
Yingmin Tang ◽  
Jianguo Xiao
Author(s):  
Anirban Mukhopadhyay ◽  
Sourav Kumar ◽  
Souvik Roy Chowdhury ◽  
Neelotpal Chakraborty ◽  
Ayatullah Faruk Mollah ◽  
...  

The main purpose of scene text recognition is to detect texts in a given image. The problem of text detection and recognition in such images has gained great attention in recent years due to rising demand of several applications like visual based applications, multimedia and content-based retrieval. Due to low accuracies of existing scene text detection methods, an improved pipeline is developed for text localizing task. First, candidate text regions are generated using Maximally Stable Extremal Region and Stroke Width Transform methods that capture true positives along with many false positives. A One Class Classifier is trained to label the candidate regions obtained, as text or non-text, which in this case is suitable as non-text class cannot be adequately represented to train a binary classifier. The one class classifier is trained with some popular feature descriptors like Histogram of Oriented Gradients, Grey Level Co-Occurrence Matrix, Discrete Cosine Transform and Gabor filter. Experimental results show high recall for text containing regions and reducing false positives.


Sign in / Sign up

Export Citation Format

Share Document