scholarly journals Urdu text in natural scene images: a new dataset and preliminary text detection

2021 ◽  
Vol 7 ◽  
pp. e717
Author(s):  
Hazrat Ali ◽  
Khalid Iqbal ◽  
Ghulam Mujtaba ◽  
Ahmad Fayyaz ◽  
Mohammad Farhad Bulbul ◽  
...  

Text detection in natural scene images for content analysis is an interesting task. The research community has seen some great developments for English/Mandarin text detection. However, Urdu text extraction in natural scene images is a task not well addressed. In this work, firstly, a new dataset is introduced for Urdu text in natural scene images. The dataset comprises of 500 standalone images acquired from real scenes. Secondly, the channel enhanced Maximally Stable Extremal Region (MSER) method is applied to extract Urdu text regions as candidates in an image. Two-stage filtering mechanism is applied to eliminate non-candidate regions. In the first stage, text and noise are classified based on their geometric properties. In the second stage, a support vector machine classifier is trained to discard non-text candidate regions. After this, text candidate regions are linked using centroid-based vertical and horizontal distances. Text lines are further analyzed by a different classifier based on HOG features to remove non-text regions. Extensive experimentation is performed on the locally developed dataset to evaluate the performance. The experimental results show good performance on test set images. The dataset will be made available for research use. To the best of our knowledge, the work is the first of its kind for the Urdu language and would provide a good dataset for free research use and serve as a baseline performance on the task of Urdu text extraction.

2018 ◽  
Vol 7 (2.12) ◽  
pp. 29
Author(s):  
Jae Ho Yang ◽  
Gang Seong Lee ◽  
Young Pyo Hong ◽  
Sang Hun Lee

Background/Objectives: In this paper, we propose a hybrid scene-detection method using an edge and textural analysis in natural scene images, and finally, we detect the text regions by removing the non-text regions through a pattern analysis of each region.Methods/Statistical analysis: The proposed algorithm is divided into the pre-processing stage and the extraction processing stage to perform the text detection. The lost texts that are improved through a histogram equalization for the minimization of the loss of the text parts that is due to light exposure are detected before the edge detection. After that, the edge is detected using the Canny operator. The detected edge is obtained in the step of applying the SWT algorithm to detect the text candidate regions. The extraction processing step is the step of removing the noise region that is detected by the pixel analysis of the SWT algorithm, and it analyzes the pattern of the text regions and then removes the non-text regions to finally detect the text regions. For the quantitative comparison of the proposed algorithm, our results are compared with the ground-truth image using the precision, recall, and F-measure.Findings: One of the existing text-detection algorithms, the edge-based method, is problematic, as, in addition to the text, the complex backgrounds and textures are detected as the edges in natural scene images. The connected component-based method is also problematic, as the non-text region is included in the text region in the process of finding the connection component.Improvements/Applications: The proposed method shows an effective text-detection result regardless of the light exposure in natural scene images compared with the conventional text-detection algorithm.  


Of late, the rapid development in the technology and multimedia capability in digital cameras and mobile devices has led to ever increasing number of images or multi-media data to the digital world. Particularly, in natural scene images, the text content provides explicit information to understand the semantics of images. Therefore, a system developed for extracting and recognizing texts accurately from natural scene images, in real-time, has significant relevance to numerous applications such as, assistive technology for people with vision impairment, tourist with language barrier, vehicle number plate detection, street signs, advertisement bill boards, robotics, etc. The extraction of the texts from natural scene images is a formidable task due to large variations in character fonts, styles, sizes, text orientations, presence of complex backgrounds and varying light conditions. The main focus of this research paper is to propose a novel hybrid approach for automatic detection, localization, extraction and recognition of text in natural scene images with cluttered background. Firstly, image regions with text are detected using edge features (GLCM) extracted from Contourlet transformed image and SVM (Support Vector Machine) classifier. Secondly, horizontal projection is applied on text regions for segmenting lines and vertical projection is applied on each text line for segmenting characters. The proposed method for text extraction has produced the precision, recall, F-Score and accuracy of 98.50%, 90.85.62%, 95.00%, and 89.90%, respectively. And, these results prove that the proposed method is efficient. Further, the so extracted characters are processed for recognition using contourlet transform and Probabilistic Neural Network (PNN) classifier. The computed features are moment invariants. Only the English script is considered for the experimentation. The proposed character recognition method has accuracy of 79.07%, which is higher in comparison to accuracy of 75.15% obtained by KNN (K-Nearest Neighbors) classifier


Author(s):  
Sankirti Sandeep Shiravale ◽  
R. Jayadevan ◽  
Sanjeev S. Sannakki

Text present in a camera captured scene images is semantically rich and can be used for image understanding. Automatic detection, extraction, and recognition of text are crucial in image understanding applications. Text detection from natural scene images is a tedious task due to complex background, uneven light conditions, multi-coloured and multi-sized font. Two techniques, namely ‘edge detection' and ‘colour-based clustering', are combined in this paper to detect text in scene images. Region properties are used for elimination of falsely generated annotations. A dataset of 1250 images is created and used for experimentation. Experimental results show that the combined approach performs better than the individual approaches.


2019 ◽  
Vol 2019 (8) ◽  
pp. 5397-5406
Author(s):  
Angia Venkatesan Karpagam ◽  
Mohan Manikandan

2014 ◽  
Vol 41 (18) ◽  
pp. 8027-8048 ◽  
Author(s):  
Anhar Risnumawan ◽  
Palaiahankote Shivakumara ◽  
Chee Seng Chan ◽  
Chew Lim Tan

Sign in / Sign up

Export Citation Format

Share Document