Analysis of Text Identification Techniques Using Scene Text and Optical Character Recognition

In today's era, data in digitalized form is needed for faster processing and performing of all tasks. The best way to digitalize the documents is by extracting the text from them. This work of text extraction can be performed by various text identification tasks such as scene text recognition, optical character recognition, handwriting recognition, and much more. This paper presents, reviews, and analyses recent research expansion in the area of optical character recognition and scene text recognition based on various existing models such as convolutional neural network, long short-term memory, cognitive reading for image processing, maximally stable extreme regions, stroke width transformation, and achieved remarkable results up to 90.34% of F-score with benchmark datasets such as ICDAR 2013, ICDAR 2019, IIIT5k. The researchers have done outstanding work in the text recognition field. Yet, improvement in text detection in low-quality image performance is required, as text identification should not be limited to the input quality of the image.

Download Full-text

SCENE TEXT RECOGNITION BY USING EE-MSER AND OPTICAL CHARACTER RECOGNITION FOR NATURAL IMAGES

International Journal of Advance Engineering and Research Development ◽

10.21090/ijaerd.021219 ◽

2015 ◽

Vol 2 (12) ◽

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Natural Images ◽

Text Recognition ◽

Optical Character ◽

Scene Text ◽

Scene Text Recognition

Download Full-text

Multi-granularity Deep Local Representations for Irregular Scene Text Recognition

ACM/IMS Transactions on Data Science ◽

10.1145/3446971 ◽

2021 ◽

Vol 2 (2) ◽

pp. 1-18

Author(s):

Hongchao Gao ◽

Yujia Li ◽

Jiao Dai ◽

Xi Wang ◽

Jizhong Han ◽

...

Keyword(s):

State Of The Art ◽

Visual Representation ◽

Text Recognition ◽

Natural Scene ◽

Attention Network ◽

Training Time ◽

Scene Text ◽

Benchmark Datasets ◽

Local Representations ◽

Scene Text Recognition

Recognizing irregular text from natural scene images is challenging due to the unconstrained appearance of text, such as curvature, orientation, and distortion. Recent recognition networks regard this task as a text sequence labeling problem and most networks capture the sequence only from a single-granularity visual representation, which to some extent limits the performance of recognition. In this article, we propose a hierarchical attention network to capture multi-granularity deep local representations for recognizing irregular scene text. It consists of several hierarchical attention blocks, and each block contains a Local Visual Representation Module (LVRM) and a Decoder Module (DM). Based on the hierarchical attention network, we propose a scene text recognition network. The extensive experiments show that our proposed network achieves the state-of-the-art performance on several benchmark datasets including IIIT-5K, SVT, CUTE, SVT-Perspective, and ICDAR datasets under shorter training time.

Download Full-text

Handwriting Recognition System Using Optical Character Recognition

International Journal of Scientific Research in Computer Sciences and Engineering ◽

10.26438/ijsrcse/v6i3.1821 ◽

2018 ◽

Vol 6 (3) ◽

pp. 18-21

Author(s):

Priti Gangania ◽

Sowmya Mishra ◽

Shreshtha Garg ◽

Sonam Agarwal

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Handwriting Recognition ◽

Recognition System ◽

Optical Character

Download Full-text

Optical Character Recognition of Indian Language Manuscripts using Convolutional Neural Networks

Design Engineering ◽

10.17762/de.v2021i3.7789 ◽

2021 ◽

pp. 894-911

Author(s):

Bhavesh Kataria, Dr. Harikrishna B. Jethva

Keyword(s):

Neural Networks ◽

Character Recognition ◽

Optical Character Recognition ◽

Short Term Memory ◽

Short Term ◽

Indian Language ◽

Term Memory ◽

Text Document ◽

Optical Character ◽

Long Short Term Memory

India's constitution has 22 languages written in 17 different scripts. These materials have a limited lifespan, and as generations pass, these materials deteriorate, and the vital knowledge is lost. This work uses digital texts to convey information to future generations. Optical Character Recognition (OCR) helps extract information from scanned manuscripts (printed text). This paper proposes a simple and effective solution of optical character recognition (OCR) Sanskrit Character from text document images using long short-term memory (LSTM) and neural networks of Sanskrit Characters. Existing methods focuses only upon the single touching characters. But our main focus is to design a robust method using Bidirectional Long Short-Term Memory (BLSTM) architecture for overlapping lines, touching characters in middle and upper zone and half character which would increase the accuracy of the present OCR system for recognition of poorly maintained Sanskrit literature.

Download Full-text

Optical character recognition and long short-term memory neural network approach for book classification by librarians

Journal of Physics Conference Series ◽

10.1088/1742-6596/1567/3/032034 ◽

2020 ◽

Vol 1567 ◽

pp. 032034

Author(s):

YD Rosita ◽

YN Sukmaningtyas

Keyword(s):

Neural Network ◽

Character Recognition ◽

Optical Character Recognition ◽

Short Term Memory ◽

Network Approach ◽

Short Term ◽

Neural Network Approach ◽

Term Memory ◽

Optical Character ◽

Long Short Term Memory

Download Full-text

An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition

2011 International Conference on Document Analysis and Recognition ◽

10.1109/icdar.2011.148 ◽

2011 ◽

Cited By ~ 20

Author(s):

Hongwei Zhang ◽

Changsong Liu ◽

Cheng Yang ◽

Xiaoqing Ding ◽

KongQiao Wang

Keyword(s):

Random Field ◽

Character Recognition ◽

Optical Character Recognition ◽

Extraction Method ◽

Conditional Random Field ◽

Text Extraction ◽

Optical Character ◽

Scene Text

Download Full-text

Aplikasi Kalkulator Tulisan Tangan Sederhana Menggunakan Optical Character Recognition (OCR)

Applied Technology and Computing Science Journal ◽

10.33086/atcsj.v3i2.1867 ◽

2021 ◽

Vol 3 (2) ◽

pp. 103-116

Author(s):

Supriadi Supriadi

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Text Recognition ◽

Arithmetic Operations ◽

Written Text ◽

Optical Character ◽

Calculation Results

The calculator is a calculation tool that is widely used in various specialized fields of business and commerce. The use of a calculator makes it easier for humans to perform arithmetic operations, but there are obstacles in the process of inputting numbers if you want to calculate the value of numbers on written media such as paper, whiteboards and so on. The user must first see the text on written media, then read it and remember it then type the writing on a calculator tool or application. The drawback of this method is that when the user forgets the writing on the written media, the user will see the written text and remember it again so that it takes longer to perform calculations using a calculator. The method used in this study is Optical Character Recognition, this method can recognize text contained in images or handwritten images of mathematical number operations. The results of the text recognition will then be carried out by arithmetic calculations to get the calculation results. From the trials on 20 handwritten images of mathematical number operations, the results obtained were 85% accuracy of extraction and accuracy of handwritten images that can be calculated and correct by 85%

Download Full-text

Product Label Reading System for Blind People using Support Vector Machine Algorithm

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1047.0886s219 ◽

2019 ◽

Vol 8 (6S2) ◽

pp. 179-186

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Region Of Interest ◽

Ground Truth ◽

Support Vector ◽

Svm Classifier ◽

Stroke Width ◽

Optical Character ◽

Character Size ◽

Reading System

Theoretical—This paper shows a camera based assistive content perusing of item marks from articles to support outwardly tested individuals. Camera fills in as fundamental wellspring of info. To recognize the items, the client will move the article before camera and this moving item will be identified by Background Subtraction (BGS) Method. Content district will be naturally confined as Region of Interest (ROI). Content is extricated from ROI by consolidating both guideline based and learning based technique. A tale standard based content limitation calculation is utilized by recognizing geometric highlights like pixel esteem, shading force, character size and so forth and furthermore highlights like Gradient size, slope width and stroke width are found out utilizing SVM classifier and a model is worked to separate content and non-content area. This framework is coordinated with OCR (Optical Character Recognition) to extricate content and the separated content is given as a voice yield to the client. The framework is assessed utilizing ICDAR-2011 dataset which comprise of 509 common scene pictures with ground truth.

Download Full-text

Research on Deep Learning Techniques in Breaking Text-Based Captchas and Designing Image-Based Captcha

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-900 ◽

2021 ◽

pp. 266-269

Author(s):

Janarthanan A ◽

Pandiyarajan C ◽

Sabarinathan M ◽

Sudhan M ◽

Kala R

Keyword(s):

Deep Learning ◽

Image Classification ◽

Character Recognition ◽

Optical Character Recognition ◽

Experimental Results ◽

Text Recognition ◽

Image Resizing ◽

Optical Character ◽

Learning Techniques ◽

Text Images

Optical character recognition (OCR) is a process of text recognition in images (one word). The input images are taken from the dataset. The collected text images are implemented to pre-processing. In pre-processing, we can implement the image resize process. Image resizing is necessary when you need to increase or decrease the total number of pixels, whereas remapping can occur when you are zooming refers to increase the quantity of pixels, so that when you zoom an image, you will see clear content. After that, we can implement the segmentation process. In segmentation, we can segment the each characters in one word. We can extract the features values from the image that means test feature. In classification process, we have to classify the text from the image. Image classification is performed the images in order to identify which image contains text. A classifier is used to identify the image containing text. The experimental results shows that the accuracy.

Download Full-text

A Study of Novel Optical Character Recognition Algorithms

Journal of University of Shanghai for Science and Technology ◽

10.51201/jusst/21/05265 ◽

2021 ◽

Vol 23 (06) ◽

pp. 301-305

Author(s):

Roshan Suvaris ◽

◽

Dr. S Sathyanarayana ◽

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Human Life ◽

Handwriting Recognition ◽

Digital Storage ◽

Optical Character ◽

Pros And Cons ◽

Computer Text ◽

Almost All ◽

Textual Content

Optical Character Recognition has been an inseparable part of human life during everyday transactions. The OCR has extended its application areas in almost all fields viz. healthcare, finance, banking, entertainment, trading system, digital storage, and so on. In the recent past, handwriting recognition is one of the hardest study areas in the area of image processing. In this paper, the various techniques for converting textual content from number plates, printed, handwritten paper documents into machine code have been discussed. The transforming method used in all these techniques is known as OCR. The English OCR system is necessary for the conversion of various published books and other documents in English into human editable computer text files. The latest researches in this area have included methodologies that identify different fonts and styles of English handwritten scripts. As of date, even though a number of algorithms are available, it has its own pros and cons. Since the recognition of different styles and fonts in machine-printed and handwritten English script is the biggest challenge, this field is open for researchers to implement new algorithms that would overcome the deficiencies of its predecessors.

Download Full-text