Efficient Image Denoising for Effective Digitization using Image Processing Techniques and Neural Networks

2016 ◽  
Vol 7 (4) ◽  
pp. 77-93 ◽  
Author(s):  
K.G. Srinivasa ◽  
B.J. Sowmya ◽  
D. Pradeep Kumar ◽  
Chetan Shetty

Vast reserves of information are found in ancient texts, scripts, stone tablets etc. However due to difficulty in creating new physical copies of such texts, knowledge to be obtained from them is limited to those few who have access to such resources. With the advent of Optical Character Recognition (OCR) efforts have been made to digitize such information. This increases their availability by making it easier to share, search and edit. Many documents are held back due to being damaged. This gives rise to an interesting problem of removing the noise from such documents so it becomes easier to apply OCR on them. Here the authors aim to develop a model that helps denoise images of such documents retaining on the text. The primary goal of their project is to help ease document digitization. They intend to study the effects of combining image processing techniques and neural networks. Image processing techniques like thresholding, filtering, edge detection, morphological operations, etc. will be applied to pre-process images to yield higher accuracy of neural network models.

2018 ◽  
pp. 1091-1108
Author(s):  
K.G. Srinivasa ◽  
B.J. Sowmya ◽  
D. Pradeep Kumar ◽  
Chetan Shetty

Vast reserves of information are found in ancient texts, scripts, stone tablets etc. However due to difficulty in creating new physical copies of such texts, knowledge to be obtained from them is limited to those few who have access to such resources. With the advent of Optical Character Recognition (OCR) efforts have been made to digitize such information. This increases their availability by making it easier to share, search and edit. Many documents are held back due to being damaged. This gives rise to an interesting problem of removing the noise from such documents so it becomes easier to apply OCR on them. Here the authors aim to develop a model that helps denoise images of such documents retaining on the text. The primary goal of their project is to help ease document digitization. They intend to study the effects of combining image processing techniques and neural networks. Image processing techniques like thresholding, filtering, edge detection, morphological operations, etc. will be applied to pre-process images to yield higher accuracy of neural network models.


Author(s):  
Abhishek Das ◽  
Mihir Narayan Mohanty

In this chapter, the authors have given a detailed review on optical character recognition. Various methods are used in this field with different accuracy levels. Still there are some difficulties in recognizing handwritten characters because of different writing styles of different individuals even in a particular language. A comparative study is given to understand different types of optical character recognition along with different methods used in each type. Implementation of neural network in different forms is found in most of the works. Different image processing techniques like OCR with CNN, RNN, combination of CNN and RNN, etc. are observed in recent research works.


2013 ◽  
Vol 764 ◽  
pp. 161-164
Author(s):  
Wei Jiang

A BP neural networks is presented for billet character recognition. Firstly, by a series of image processing techniques, the character’feature in the billet character region of the video image gathered by frame grabber is abstracted. Secondly, the BP neural networks algorithm is employed for character recognition. Application results show that the image recognition based BP neural networks can performs well in billet character recognition, and the method presented is speedy, efficient and of high value in practice.


Author(s):  
Farhana Ahmad Poad ◽  
Noor Shuraya Othman ◽  
Roshayati Yahya Atan ◽  
Jusrorizal Fadly Jusoh ◽  
Mumtaz Anwar Hussin

The aim of this project is to design an Automated Detection of License Plate (ADLP) system based on image processing techniques. There are two techniques that are commonly used in detecting the target, which are the Optical Character Recognition (OCR) and the split and merge segmentation. Basically, the OCR technique performs the operation using individual character of the license plate with alphanumeri characteristic. While, the split and merge segmentation technique split the image of captured plate into a region of interest. These two techniques are utilized and implemented using MATLAB software and the performance of detection is tested on the image and a comparison is done between both techniques. The results show that both techniques can perform well for license plate with some error.


2020 ◽  
Vol 32 (2) ◽  
Author(s):  
Gideon Jozua Kotzé ◽  
Friedel Wolff

As more natural language processing (NLP) applications benefit from neural network based approaches, it makes sense to re-evaluate existing work in NLP. A complete pipeline for digitisation includes several components handling the material in sequence. Image processing after scanning the document has been shown to be an important factor in final quality. Here we compare two different approaches for visually enhancing documents before Optical Character Recognition (OCR), (1) a combination of ImageMagick and Unpaper and (2) OCRopus. We also compare Calamari, a new line-based OCR package using neural networks, with the well-known Tesseract 3 as the OCR component. Our evaluation on a set of Setswana documents reveals that the combination of ImageMagick/Unpaper and Calamari improves on a current baseline based on Tesseract 3 and ImageMagick/Unpaper with over 30%, achieving a mean character error rate of 1.69 across all combined test data.


Sign in / Sign up

Export Citation Format

Share Document