A document binarization method based on connected operators

Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO`17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.

Download Full-text

An Adaptive Binarization Method for Cost-efficient Document Image System in Wavelet Domain

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2020.64.3.030401 ◽

2020 ◽

Vol 64 (3) ◽

pp. 30401-1-30401-14 ◽

Cited By ~ 1

Author(s):

Chih-Hsien Hsia ◽

Ting-Yu Lin ◽

Jen-Shiun Chiang

Keyword(s):

Wavelet Transform ◽

Discrete Wavelet Transform ◽

Document Image ◽

Background Information ◽

Raspberry Pi ◽

Discrete Wavelet ◽

Image System ◽

Low Frequencies ◽

Cost Efficient ◽

Binarization Method

Abstract In recent years, the preservation of handwritten historical documents and scripts archived by digitized images has been gradually emphasized. However, the selection of different thicknesses of the paper for printing or writing is likely to make the content of the back page seep into the front page. In order to solve this, a cost-efficient document image system is proposed. In this system, the authors use Adaptive Directional Lifting-Based Discrete Wavelet Transform to transform image data from spatial domain to frequency domain and perform on high and low frequencies, respectively. For low frequencies, the authors use local threshold to remove most background information. For high frequencies, they use modified Least Mean Square training algorithm to produce a unique weighted mask and perform convolution on original frequency, respectively. Afterward, Inverse Adaptive Directional Lifting-Based Discrete Wavelet Transform is performed to reconstruct the four subband images to a resulting image with original size. Finally, a global binarization method, Otsu’s method, is applied to transform a gray scale image to a binary image as the output result. The results show that the difference in operation time of this work between a personal computer (PC) and Raspberry Pi is little. Therefore, the proposed cost-efficient document image system which performed on Raspberry Pi embedded platform has the same performance and obtains the same results as those performed on a PC.

Download Full-text

A new hybrid binarization method based on Kmeans

2014 6th International Symposium on Communications, Control and Signal Processing (ISCCSP) ◽

10.1109/isccsp.2014.6877830 ◽

2014 ◽

Cited By ~ 7

Author(s):

Mahmoud Soua ◽

Rostom Kachouri ◽

Mohamed Akil

Keyword(s):

Binarization Method

Download Full-text

Flat zones filtering, connected operators, and filters by reconstruction

IEEE Transactions on Image Processing ◽

10.1109/83.403422 ◽

1995 ◽

Vol 4 (8) ◽

pp. 1153-1160 ◽

Cited By ~ 365

Author(s):

P. Salembier ◽

J. Serra

Keyword(s):

Connected Operators

Download Full-text

Automatic binarization method in ISAR image

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2015.7327060 ◽

2015 ◽

Author(s):

Xiaohui Zhao ◽

Yicheng Jiang ◽

Yun Zhang

Keyword(s):

Binarization Method

Download Full-text

Practical Extensions of Connected Operators

Mathematical Morphology and its Applications to Image and Signal Processing - Computational Imaging and Vision ◽

10.1007/978-1-4613-0469-2_12 ◽

1996 ◽

pp. 97-110 ◽

Cited By ~ 13

Author(s):

P. Salembier ◽

A. Oliveras

Keyword(s):

Connected Operators

Download Full-text

Application of Zernike moments in computer vision problems for infrared images

E3S Web of Conferences ◽

10.1051/e3sconf/202131001002 ◽

2021 ◽

Vol 310 ◽

pp. 01002

Author(s):

Dmitriy Otkupman ◽

Sergey Bezdidko ◽

Victoria Ostashenkova

Keyword(s):

Computer Vision ◽

Machine Vision ◽

Thermal Imaging ◽

Imaging System ◽

Infrared Region ◽

Zernike Moments ◽

Infrared Images ◽

Theoretical Justification ◽

Thermal Imaging System ◽

Binarization Method

The efficiency of using Zernike moments when working with digital images obtained in the infrared region of the spectrum is considered to improve the accuracy and speed of an autonomous thermal imaging system. The theoretical justification of the choice of Zernike moments for solving computer (machine) vision problems and the choice of a suitable threshold binarization method is given. In order to verify the adequacy and expediency of using the chosen method, practical studies were conducted on the use of Zernike methods for distorting various thermal images in shades of gray.

Download Full-text

A Lightweight Dynamic Binarization Method for 2D Barcode Scanning Terminal

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.130-134.4079 ◽

2011 ◽

Vol 130-134 ◽

pp. 4079-4083

Author(s):

Jia Jia Li ◽

Ke Liang Zhang ◽

Gang Wei ◽

Bai Feng Wu

Keyword(s):

Density Ratio ◽

Recognition System ◽

Gray Level ◽

Two Dimensional ◽

2D Barcode ◽

Space Cost ◽

Binarization Method ◽

Local Average ◽

Number Of Iterations ◽

Average Gray Level

It is a difficult task to binarize image under uneven illumination, and this problem is always met in the image recognition system, such as two-dimensional barcode scanning terminal. In this paper, an efficient approach is proposed to binarize image which can tolerant uneven illumination and different light intensity. The method initializes thresholds with local average gray level and adjusts thresholds by calculating light density ratio. Due to characteristic of our approach, it can even obtain a sound result by limiting number of iterations which will seriously reduce computations and space cost. According to experiments, we can find that our method can achieve a good performance and meet the real-time requirement and quality demand for barcode scanning terminal.

Download Full-text