An Adaptive Binarization Method for Cost-efficient Document Image System in Wavelet Domain

2020 ◽  
Vol 64 (3) ◽  
pp. 30401-1-30401-14 ◽  
Author(s):  
Chih-Hsien Hsia ◽  
Ting-Yu Lin ◽  
Jen-Shiun Chiang

Abstract In recent years, the preservation of handwritten historical documents and scripts archived by digitized images has been gradually emphasized. However, the selection of different thicknesses of the paper for printing or writing is likely to make the content of the back page seep into the front page. In order to solve this, a cost-efficient document image system is proposed. In this system, the authors use Adaptive Directional Lifting-Based Discrete Wavelet Transform to transform image data from spatial domain to frequency domain and perform on high and low frequencies, respectively. For low frequencies, the authors use local threshold to remove most background information. For high frequencies, they use modified Least Mean Square training algorithm to produce a unique weighted mask and perform convolution on original frequency, respectively. Afterward, Inverse Adaptive Directional Lifting-Based Discrete Wavelet Transform is performed to reconstruct the four subband images to a resulting image with original size. Finally, a global binarization method, Otsu’s method, is applied to transform a gray scale image to a binary image as the output result. The results show that the difference in operation time of this work between a personal computer (PC) and Raspberry Pi is little. Therefore, the proposed cost-efficient document image system which performed on Raspberry Pi embedded platform has the same performance and obtains the same results as those performed on a PC.

Author(s):  
Nikolajs Bogdanovs ◽  
Elans Grabs ◽  
Ernests Petersons

This article describes real-time discrete wavelet transform algorithm implementation for high-level programming language. The article describes multiscale transform algorithms both for direct discrete wavelet transform and inverse discrete wavelet transform. This algorithm has been implemented in C++ programming language and tested with Raspberry Pi microprocessor system. This article proposes the improved delay line algorithm without full shifting of register. New algorithm requires single reading operation, single writing operation and one division calculation for any length of delay line. The article includes experimental measurements of processing time on Raspberry Pi for various scale numbers. The algorithm described in this article can be used with any software tool capable of using high level programming language, for example Matlab, Octave, Opnet, etc. This is the main purpose – to create algorithm which is not tied strictly to hardware implementation but also, nonetheless, provides real-time discrete wavelet analysis capability.


Author(s):  
Suvit Poomrittigul ◽  
Masahiro Iwahashi

In JPEG2000 (JP2K) color image system, loss less coding signal is not able to be reconstructed with lossy decoder directly. Then, this report proposes a new transcoding between lossless encoder and standard lossy decoder for color image signals base on JP2K. A proposed encoder is required reversible color transform (RCT) and reversible discrete wavelet transform (RDWT) with compatibility to standard lossy decoder based on JP2K (JP2K lossy decoder). To improve the compatibility, proposed encoder is designed by using Non-scaled RCT and Non-Scaled RDWT with embedding scaling parameter into quantization header. Then, this method can be practical use with JP2K lossy decoder without any change. It also reduces total rounding error and lifting steps. The results show that proposed method can keep lossless coding performance and improve transcoding functionality to JP2K lossy decoder. The quality of transcoding image was achieved to 50.05 dB (PSNR).


Informatica ◽  
2013 ◽  
Vol 24 (4) ◽  
pp. 657-675
Author(s):  
Jonas Valantinas ◽  
Deividas Kančelkis ◽  
Rokas Valantinas ◽  
Gintarė Viščiūtė

Sign in / Sign up

Export Citation Format

Share Document