A document binarization method based on connected operators

2010 ◽  
Vol 31 (11) ◽  
pp. 1251-1259 ◽  
Author(s):  
Benoît Naegel ◽  
Laurent Wendling
2019 ◽  
Vol 43 (5) ◽  
pp. 825-832 ◽  
Author(s):  
P.V. Bezmaternykh ◽  
D.A. Ilin ◽  
D.P. Nikolaev

Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO`17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.


2020 ◽  
Vol 64 (3) ◽  
pp. 30401-1-30401-14 ◽  
Author(s):  
Chih-Hsien Hsia ◽  
Ting-Yu Lin ◽  
Jen-Shiun Chiang

Abstract In recent years, the preservation of handwritten historical documents and scripts archived by digitized images has been gradually emphasized. However, the selection of different thicknesses of the paper for printing or writing is likely to make the content of the back page seep into the front page. In order to solve this, a cost-efficient document image system is proposed. In this system, the authors use Adaptive Directional Lifting-Based Discrete Wavelet Transform to transform image data from spatial domain to frequency domain and perform on high and low frequencies, respectively. For low frequencies, the authors use local threshold to remove most background information. For high frequencies, they use modified Least Mean Square training algorithm to produce a unique weighted mask and perform convolution on original frequency, respectively. Afterward, Inverse Adaptive Directional Lifting-Based Discrete Wavelet Transform is performed to reconstruct the four subband images to a resulting image with original size. Finally, a global binarization method, Otsu’s method, is applied to transform a gray scale image to a binary image as the output result. The results show that the difference in operation time of this work between a personal computer (PC) and Raspberry Pi is little. Therefore, the proposed cost-efficient document image system which performed on Raspberry Pi embedded platform has the same performance and obtains the same results as those performed on a PC.


2021 ◽  
Vol 310 ◽  
pp. 01002
Author(s):  
Dmitriy Otkupman ◽  
Sergey Bezdidko ◽  
Victoria Ostashenkova

The efficiency of using Zernike moments when working with digital images obtained in the infrared region of the spectrum is considered to improve the accuracy and speed of an autonomous thermal imaging system. The theoretical justification of the choice of Zernike moments for solving computer (machine) vision problems and the choice of a suitable threshold binarization method is given. In order to verify the adequacy and expediency of using the chosen method, practical studies were conducted on the use of Zernike methods for distorting various thermal images in shades of gray.


2011 ◽  
Vol 130-134 ◽  
pp. 4079-4083
Author(s):  
Jia Jia Li ◽  
Ke Liang Zhang ◽  
Gang Wei ◽  
Bai Feng Wu

It is a difficult task to binarize image under uneven illumination, and this problem is always met in the image recognition system, such as two-dimensional barcode scanning terminal. In this paper, an efficient approach is proposed to binarize image which can tolerant uneven illumination and different light intensity. The method initializes thresholds with local average gray level and adjusts thresholds by calculating light density ratio. Due to characteristic of our approach, it can even obtain a sound result by limiting number of iterations which will seriously reduce computations and space cost. According to experiments, we can find that our method can achieve a good performance and meet the real-time requirement and quality demand for barcode scanning terminal.


Sign in / Sign up

Export Citation Format

Share Document