degraded document Latest Research Papers

An Unsupervised and Robust Line and Word Segmentation Method for Handwritten and Degraded Printed Document

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3474118 ◽

2022 ◽

Vol 21 (2) ◽

pp. 1-31

Author(s):

Jayati Mukherjee ◽

Swapan K. Parui ◽

Utpal Roy

Keyword(s):

Quantitative Measure ◽

Classification Problem ◽

Input Image ◽

Document Image ◽

Segmentation Method ◽

Analysis Problem ◽

Document Structure ◽

Computational Resources ◽

Degraded Document ◽

Document Page

Segmentation of text lines and words in an unconstrained handwritten or a machine-printed degraded document is a challenging document analysis problem due to the heterogeneity in the document structure. Often there is un-even skew between the lines and also broken words in a document. In this article, the contribution lies in segmentation of a document page image into lines and words. We have proposed an unsupervised, robust, and simple statistical method to segment a document image that is either handwritten or machine-printed (degraded or otherwise). In our proposed method, the segmentation is treated as a two-class classification problem. The classification is done by considering the distribution of gap size (between lines and between words) in a binary page image. Our method is very simple and easy to implement. Other than the binarization of the input image, no pre-processing is necessary. There is no need of high computational resources. The proposed method is unsupervised in the sense that no annotated document page images are necessary. Thus, the issue of a training database does not arise. In fact, given a document page image, the parameters that are needed for segmentation of text lines and words are learned in an unsupervised manner. We have applied our proposed method on several popular publicly available handwritten and machine-printed datasets (ISIDDI, IAM-Hist, IAM, PBOK) of different Indian and other languages containing different fonts. Several experimental results are presented to show the effectiveness and robustness of our method. We have experimented on ICDAR-2013 handwriting segmentation contest dataset and our method outperforms the winning method. In addition to this, we have suggested a quantitative measure to compute the level of degradation of a document page image.

Degraded document image preprocessing using local adaptive sharpening and illumination compensation

Pattern Analysis and Applications ◽

10.1007/s10044-021-01038-z ◽

2022 ◽

Author(s):

Hong Xia Wang ◽

Bang Song ◽

Jian Chen ◽

Yi Yang

Keyword(s):

Document Image ◽

Illumination Compensation ◽

Image Preprocessing ◽

Degraded Document

Self-supervised degraded document image binarization method based on second-order central moments and multi-scale network

10.1117/12.2626563 ◽

2021 ◽

Author(s):

Jinlu Zhang ◽

Liyu Lin ◽

Changlong Jin

Keyword(s):

Document Image ◽

Second Order ◽

Image Binarization ◽

Central Moments ◽

Multi Scale ◽

Document Image Binarization ◽

Scale Network ◽

Degraded Document ◽

Binarization Method

Enhancement of degraded document images via augmented fourth order partial differential equation and Total Variation-based illumination estimation

Optik ◽

10.1016/j.ijleo.2021.168262 ◽

2021 ◽

pp. 168262

Author(s):

Uche A. Nnolim

Keyword(s):

Differential Equation ◽

Partial Differential Equation ◽

Total Variation ◽

Fourth Order ◽

Order Partial Differential Equation ◽

Document Images ◽

Partial Differential ◽

Illumination Estimation ◽

Degraded Document

Document image binarization using difference of concatenated convolutions

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210015 ◽

2021 ◽

pp. 1-14

Author(s):

R.L. Jyothi ◽

M. Abdul Rahiman

Keyword(s):

Parameter Tuning ◽

Document Image ◽

Document Images ◽

Historical Document ◽

Proposed Model ◽

Segmentation Methods ◽

Document Image Binarization ◽

Low Pass ◽

Degraded Document ◽

Document Binarization

Binarization is the most important stage in historical document image processing. Efficient working of character and word recognition algorithms depend on effective segmentation methods. Segmentation algorithms in turn depend on images free of noises and degradations. Most of these historical documents are illegible with degradations like bleeding through degradation, faded ink or faint characters, uneven illumination, contrast variation, etc. For effective processing of these document images, efficient binarization algorithms should be devised. Here a simple modified version of the Convolutional Neural Network (CNN) is proposed for historical document binarization. AOD-Net architecture for generating dehazed images from hazed images is modified to create the proposed network.The new CNN model is created by incorporating Difference of Concatenation layer (DOC), Enhancement layer (EN) and Thresholding layer into AOD-Net to make it suitable for binarization of highly degraded document images. The DOC layer and EN layer work effectively in solving degradation that exists in the form of low pass noises. The complexity of working of the proposed model is reduced by decreasing the number of layers and by introducing filters in convolution layers that work with low inter-pixel dependency. This modified version of CNN works effectively with a variety of highly degraded documents when tested with the benchmark historical datasets. The main highlight of the proposed network is that it works efficiently in a generalized manner for any type of document images without further parameter tuning. Another important highlight of this method is that it can handle most of the degradation categories present in document images. In this work, the performance of the proposed model is compared with Otsu, Sauvola, and three recent Deep Learning-based models.

Dynamic Selective Edge-Based Integer/Fractional-Order Partial Differential Equation for Degraded Document Image Binarization

International Journal of Image and Graphics ◽

10.1142/s0219467822500309 ◽

2021 ◽

pp. 2250030

Author(s):

Uche A. Nnolim

Keyword(s):

Document Image ◽

Superior Performance ◽

Order Partial Differential Equation ◽

Color Spaces ◽

Image Binarization ◽

Partial Differential ◽

Document Image Binarization ◽

Edge Based ◽

Degraded Document ◽

Bleed Through

Conventional thresholding algorithms have had limited success with degraded document images. Recently, partial differential equations (PDEs) have been applied with good results. However, these are usually tailored to handle relatively few specific distortions. In this study, we combine an edge detection term with a linear binarization source term in a PDE formulation. Additionally, a new proposed diffusivity function further amplifies desired edges. It also suppresses undesired edges that comprise bleed-through effects. Furthermore, we develop the fractional variant of the proposed scheme, which further improves results and provides more flexibility. Moreover, nonlinear color spaces are utilized to improve binarization results for images with color distortion. The proposed scheme removes document image degradation such as bleed-through, stains, smudges, etc., and also restores faded text in the images. Experimental subjective and objective results show consistently superior performance of the proposed approach compared to the state-of-the-art PDE-based models.

Document Image Binarization using Image Segmentation Technique

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.36597 ◽

2021 ◽

Vol 9 (VII) ◽

pp. 1173-1176

Author(s):

Mr. Aniket Pagare

Keyword(s):

Image Segmentation ◽

Primary Objective ◽

Document Image ◽

Document Images ◽

Image Binarization ◽

Huge Number ◽

Document Image Binarization ◽

Processor Designs ◽

Equal Work ◽

Degraded Document

Segmentation of text from badly degraded document images is an extremely difficult assignment because of the high inter/Intra variety between the record foundation and the frontal area text of various report pictures. Picture preparing and design acknowledgment algorithms set aside more effort for execution on a solitary center processor. Designs Preparing Unit (GPU) is more mainstream these days because of its speed, programmability, minimal expense and more inbuilt execution centers in it. The primary objective of this exploration work is to make binarization quicker for acknowledgment of a huge number of corrupted report pictures on GPU. In this framework, we give another picture division calculation that every pixel in the picture has its own limit proposed. We are accomplishing equal work on a window of m*n size and separate article pixel of text stroke of that window. The archive text is additionally sectioned by a nearby edge that is assessed dependent on the forces of identified content stroke edge pixels inside a nearby window.

Nonlinear diffusion equation with selective source for binarization of degraded document images

Applied Mathematical Modelling ◽

10.1016/j.apm.2021.06.023 ◽

2021 ◽

Author(s):

Zhongjie Du ◽

Chuanjiang He

Keyword(s):

Diffusion Equation ◽

Nonlinear Diffusion ◽

Nonlinear Diffusion Equation ◽

Document Images ◽

Degraded Document

Degraded document image binarization based on U-Net and transfer learning

International Conference on Signal Image Processing and Communication (ICSIPC 2021) ◽

10.1117/12.2600379 ◽

2021 ◽

Author(s):

Guochang He

Keyword(s):

Transfer Learning ◽

Document Image ◽

Image Binarization ◽

Document Image Binarization ◽

Degraded Document

Degraded Document Image Binarization using Novel Background Estimation Technique

2021 6th International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct51068.2021.9418084 ◽

2021 ◽

Author(s):

Harshit Jindal ◽

Manoj Kumar ◽

Akhil Tomar ◽

Ayush Malik

Keyword(s):

Document Image ◽

Estimation Technique ◽

Image Binarization ◽

Background Estimation ◽

Document Image Binarization ◽

Degraded Document

degraded document
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Unsupervised and Robust Line and Word Segmentation Method for Handwritten and Degraded Printed Document

Degraded document image preprocessing using local adaptive sharpening and illumination compensation

Self-supervised degraded document image binarization method based on second-order central moments and multi-scale network

Enhancement of degraded document images via augmented fourth order partial differential equation and Total Variation-based illumination estimation

Document image binarization using difference of concatenated convolutions

Dynamic Selective Edge-Based Integer/Fractional-Order Partial Differential Equation for Degraded Document Image Binarization

Document Image Binarization using Image Segmentation Technique

Nonlinear diffusion equation with selective source for binarization of degraded document images

Degraded document image binarization based on U-Net and transfer learning

Degraded Document Image Binarization using Novel Background Estimation Technique

Export Citation Format

degraded documentRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

An Unsupervised and Robust Line and Word Segmentation Method for Handwritten and Degraded Printed Document

Degraded document image preprocessing using local adaptive sharpening and illumination compensation

Self-supervised degraded document image binarization method based on second-order central moments and multi-scale network

Enhancement of degraded document images via augmented fourth order partial differential equation and Total Variation-based illumination estimation

Document image binarization using difference of concatenated convolutions

Dynamic Selective Edge-Based Integer/Fractional-Order Partial Differential Equation for Degraded Document Image Binarization

Document Image Binarization using Image Segmentation Technique

Nonlinear diffusion equation with selective source for binarization of degraded document images

Degraded document image binarization based on U-Net and transfer learning

Degraded Document Image Binarization using Novel Background Estimation Technique

degraded document
Recently Published Documents