Document image binarization using difference of concatenated convolutions

Low Pass ◽

Degraded Document ◽

Document Binarization

Binarization is the most important stage in historical document image processing. Efficient working of character and word recognition algorithms depend on effective segmentation methods. Segmentation algorithms in turn depend on images free of noises and degradations. Most of these historical documents are illegible with degradations like bleeding through degradation, faded ink or faint characters, uneven illumination, contrast variation, etc. For effective processing of these document images, efficient binarization algorithms should be devised. Here a simple modified version of the Convolutional Neural Network (CNN) is proposed for historical document binarization. AOD-Net architecture for generating dehazed images from hazed images is modified to create the proposed network.The new CNN model is created by incorporating Difference of Concatenation layer (DOC), Enhancement layer (EN) and Thresholding layer into AOD-Net to make it suitable for binarization of highly degraded document images. The DOC layer and EN layer work effectively in solving degradation that exists in the form of low pass noises. The complexity of working of the proposed model is reduced by decreasing the number of layers and by introducing filters in convolution layers that work with low inter-pixel dependency. This modified version of CNN works effectively with a variety of highly degraded documents when tested with the benchmark historical datasets. The main highlight of the proposed network is that it works efficiently in a generalized manner for any type of document images without further parameter tuning. Another important highlight of this method is that it can handle most of the degradation categories present in document images. In this work, the performance of the proposed model is compared with Otsu, Sauvola, and three recent Deep Learning-based models.

Simple and Efficient Document Image Binarization Technique For Degraded Document Images

International Journal of Scientific Research ◽

10.15373/22778179/may2014/65 ◽

2012 ◽

Vol 3 (5) ◽

pp. 217-220

Author(s):

Manju Joseph ◽

◽

Jijina K.P Jijina K.P

Keyword(s):

Document Image ◽

Document Images ◽

Image Binarization ◽

Document Image Binarization Technique for Degraded Document Images

International Journal of Computer Applications ◽

10.5120/21858-5183 ◽

2015 ◽

Vol 122 (22) ◽

pp. 22-29

Author(s):

Supriya Lokhande ◽

N.A.Dawande N.A.Dawande

Keyword(s):

Document Image ◽

Document Images ◽

Image Binarization ◽

Mobile Application for Recognizing Text in Degraded Document Images Using Optical Character Recognition with Adaptive Document Image Binarization

Journal of Image and Graphics ◽

10.18178/joig.6.1.44-47 ◽

2018 ◽

Vol 6 (1) ◽

pp. 44-47

Author(s):

Angie M. Ceniza ◽

◽

Tom Kalvin B. Archival ◽

Kate V. Bongo

Keyword(s):

Mobile Application ◽

Character Recognition ◽

Optical Character Recognition ◽

Document Image ◽

Document Images ◽

Image Binarization ◽

Optical Character ◽

An Efficient Cooperative Smearing Technique for Degraded Historical Document Image Segmentation

International Journal of Image and Graphics ◽

10.1142/s0219467821500121 ◽

2020 ◽

pp. 2150012

Author(s):

Omar Boudraa ◽

Walid Khaled Hidouci ◽

Dominique Michelucci

Keyword(s):

Image Analysis ◽

Image Segmentation ◽

Document Image ◽

Regions Of Interest ◽

Document Images ◽

Historical Document ◽

Document Image Analysis ◽

Degraded Document ◽

Document Image Segmentation

Segmentation is one of the critical steps in historical document image analysis systems that determines the quality of the search, understanding, recognition and interpretation processes. It allows isolating the objects to be considered and separating the regions of interest (paragraphs, lines, words and characters) from other entities (figures, graphs, tables, etc.). This stage follows the thresholding, which aims to improve the quality of the document and to extract its background from its foreground, also for detecting and correcting the skew that leads to redress the document. Here, a hybrid method is proposed in order to locate words and characters in both handwritten and printed documents. Numerical results prove the robustness and the high precision of our approach applied on old degraded document images over four common datasets, in which the pair (Recall, Precision) reaches approximately 97.7% and 97.9%.

U-Net-bin: hacking the document image binarization contest

Computer Optics ◽

10.18287/2412-6179-2019-43-5-825-832 ◽

2019 ◽

Vol 43 (5) ◽

pp. 825-832 ◽

Cited By ~ 2

Author(s):

P.V. Bezmaternykh ◽

D.A. Ilin ◽

D.P. Nikolaev

Keyword(s):

Ground Truth ◽

Document Image ◽

Training Data ◽

Data Preparation ◽

Historical Document ◽

Image Binarization ◽

Biomedical Image ◽

Binarization Method ◽

Document Binarization

Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO`17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.

Robust Document Image Binarization Technique for Degraded Document Images

IEEE Transactions on Image Processing ◽

10.1109/tip.2012.2231089 ◽

2013 ◽

Vol 22 (4) ◽

pp. 1408-1417 ◽

Cited By ~ 113

Author(s):

Bolan Su ◽

Shijian Lu ◽

Chew Lim Tan

Keyword(s):

Document Image ◽

Document Images ◽

Image Binarization ◽

Document Image Binarization using Image Segmentation Technique

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.36597 ◽

2021 ◽

Vol 9 (VII) ◽

pp. 1173-1176

Author(s):

Mr. Aniket Pagare

Keyword(s):

Image Segmentation ◽

Primary Objective ◽

Document Image ◽

Document Images ◽

Image Binarization ◽

Huge Number ◽

Processor Designs ◽

Equal Work ◽

Segmentation of text from badly degraded document images is an extremely difficult assignment because of the high inter/Intra variety between the record foundation and the frontal area text of various report pictures. Picture preparing and design acknowledgment algorithms set aside more effort for execution on a solitary center processor. Designs Preparing Unit (GPU) is more mainstream these days because of its speed, programmability, minimal expense and more inbuilt execution centers in it. The primary objective of this exploration work is to make binarization quicker for acknowledgment of a huge number of corrupted report pictures on GPU. In this framework, we give another picture division calculation that every pixel in the picture has its own limit proposed. We are accomplishing equal work on a window of m*n size and separate article pixel of text stroke of that window. The archive text is additionally sectioned by a nearby edge that is assessed dependent on the forces of identified content stroke edge pixels inside a nearby window.

An enhanced binarization framework for degraded historical document images

EURASIP Journal on Image and Video Processing ◽

10.1186/s13640-021-00556-4 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Wei Xiong ◽

Lei Zhou ◽

Ling Yue ◽

Lirong Li ◽

Song Wang

Keyword(s):

Document Image ◽

Morphological Operations ◽

Document Images ◽

Minimum Entropy ◽

Stroke Width ◽

Background Estimation ◽

Structuring Element ◽

Benchmark Datasets ◽

Stroke Width Transform

AbstractBinarization plays an important role in document analysis and recognition (DAR) systems. In this paper, we present our winning algorithm in ICFHR 2018 competition on handwritten document image binarization (H-DIBCO 2018), which is based on background estimation and energy minimization. First, we adopt mathematical morphological operations to estimate and compensate the document background. It uses a disk-shaped structuring element, whose radius is computed by the minimum entropy-based stroke width transform (SWT). Second, we perform Laplacian energy-based segmentation on the compensated document images. Finally, we implement post-processing to preserve text stroke connectivity and eliminate isolated noise. Experimental results indicate that the proposed method outperforms other state-of-the-art techniques on several public available benchmark datasets.