Text detection and recognition in natural scenes and consumer videos

Author(s):  
Arpit Jain ◽  
Xujun Peng ◽  
Xiaodan Zhuang ◽  
Pradeep Natarajan ◽  
Huaigu Cao
2021 ◽  
pp. 198-212
Author(s):  
Aline Geovanna Soares ◽  
Byron Leite Dantas Bezerra ◽  
Estanislau Baptista Lima

Author(s):  
Fazliddin Makhmudov ◽  
Mukhriddin Mukhiddinov ◽  
Akmalbek Abdusalomov ◽  
Kuldoshbay Avazov ◽  
Utkir Khamdamov ◽  
...  

Methods for text detection and recognition in images of natural scenes have become an active research topic in computer vision and have obtained encouraging achievements over several benchmarks. In this paper, we introduce a robust yet simple pipeline that produces accurate and fast text detection and recognition for the Uzbek language in natural scene images using a fully convolutional network and the Tesseract OCR engine. First, the text detection step quickly predicts text in random orientations in full-color images with a single fully convolutional neural network, discarding redundant intermediate stages. Then, the text recognition step recognizes the Uzbek language, including both the Latin and Cyrillic alphabets, using a trained Tesseract OCR engine. Finally, the recognized text can be pronounced using the Uzbek language text-to-speech synthesizer. The proposed method was tested on the ICDAR 2013, ICDAR 2015 and MSRA-TD500 datasets, and it showed an advantage in efficiently detecting and recognizing text from natural scene images for assisting the visually impaired.


2016 ◽  
Vol 2 (9) ◽  
Author(s):  
Asit Kumar ◽  
Sumit Kumar

Text plays an significant role in day-to-day life because of its dissimilarities in text size, font, style, orientation and alignment as well as composite background and rich information, as a consequence automatic text detection in natural scenes has several attractive applications. Though, detecting and recognizing such text is all the time a challenging issue. Several text extraction techniques grounded on edge detection, connected component analysis, morphological operators, wavelet transform, texture features, neural network etc. have been established. This paper contributes comparative analysis of different technique which provides efficient performance.


Sign in / Sign up

Export Citation Format

Share Document