scholarly journals Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions

2019 ◽  
Vol 5 (4) ◽  
pp. 48 ◽  
Author(s):  
Sulaiman ◽  
Omar ◽  
Nasrudin

In this era of digitization, most hardcopy documents are being transformed into digital formats. In the process of transformation, large quantities of documents are stored and preserved through electronic scanning. These documents are available from various sources such as ancient documentation, old legal records, medical reports, music scores, palm leaf, and reports on security-related issues. In particular, ancient and historical documents are hard to read due to their degradation in terms of low contrast and existence of corrupted artefacts. In recent times, degraded document binarization has been studied widely and several approaches were developed to deal with issues and challenges in document binarization. In this paper, a comprehensive review is conducted on the issues and challenges faced during the image binarization process, followed by insights on various methods used for image binarization. This paper also discusses the advanced methods used for the enhancement of degraded documents that improves the quality of documents during the binarization process. Further discussions are made on the effectiveness and robustness of existing methods, and there is still a scope to develop a hybrid approach that can deal with degraded document binarization more effectively.

Author(s):  
Rajan Goyal ◽  
Amandeep Kaur

This paper presents a new hybrid approach for the binarization and enhancement of Historical Manuscript. This paper deals with degradations which occur due to shadows, non-uniform illumination, low contrast and strain. We follow two distinct method of Binarization with a pre-processing procedure using a adaptive Wiener filter, a rough estimation of foreground regions and a background surface calculation by interpolating neighboring background intensities. Further logical anding of the calculated background surface with compliment of second method result, performing final thresholding and post-processing in order to improve the quality of text regions. After extensive experiments, our method demonstrated superior performance against some wellknown techniques on numerous degraded document images as well as on Historical Manuscript in both manners qualitatively and quantitatively.


2021 ◽  
pp. 1-14
Author(s):  
R.L. Jyothi ◽  
M. Abdul Rahiman

Binarization is the most important stage in historical document image processing. Efficient working of character and word recognition algorithms depend on effective segmentation methods. Segmentation algorithms in turn depend on images free of noises and degradations. Most of these historical documents are illegible with degradations like bleeding through degradation, faded ink or faint characters, uneven illumination, contrast variation, etc. For effective processing of these document images, efficient binarization algorithms should be devised. Here a simple modified version of the Convolutional Neural Network (CNN) is proposed for historical document binarization. AOD-Net architecture for generating dehazed images from hazed images is modified to create the proposed network.The new CNN model is created by incorporating Difference of Concatenation layer (DOC), Enhancement layer (EN) and Thresholding layer into AOD-Net to make it suitable for binarization of highly degraded document images. The DOC layer and EN layer work effectively in solving degradation that exists in the form of low pass noises. The complexity of working of the proposed model is reduced by decreasing the number of layers and by introducing filters in convolution layers that work with low inter-pixel dependency. This modified version of CNN works effectively with a variety of highly degraded documents when tested with the benchmark historical datasets. The main highlight of the proposed network is that it works efficiently in a generalized manner for any type of document images without further parameter tuning. Another important highlight of this method is that it can handle most of the degradation categories present in document images. In this work, the performance of the proposed model is compared with Otsu, Sauvola, and three recent Deep Learning-based models.


Author(s):  
Omar Boudraa ◽  
Walid Khaled Hidouci ◽  
Dominique Michelucci

Segmentation is one of the critical steps in historical document image analysis systems that determines the quality of the search, understanding, recognition and interpretation processes. It allows isolating the objects to be considered and separating the regions of interest (paragraphs, lines, words and characters) from other entities (figures, graphs, tables, etc.). This stage follows the thresholding, which aims to improve the quality of the document and to extract its background from its foreground, also for detecting and correcting the skew that leads to redress the document. Here, a hybrid method is proposed in order to locate words and characters in both handwritten and printed documents. Numerical results prove the robustness and the high precision of our approach applied on old degraded document images over four common datasets, in which the pair (Recall, Precision) reaches approximately 97.7% and 97.9%.


2019 ◽  
Vol 43 (5) ◽  
pp. 825-832 ◽  
Author(s):  
P.V. Bezmaternykh ◽  
D.A. Ilin ◽  
D.P. Nikolaev

Image binarization is still a challenging task in a variety of applications. In particular, Document Image Binarization Contest (DIBCO) is organized regularly to track the state-of-the-art techniques for the historical document binarization. In this work we present a binarization method that was ranked first in the DIBCO`17 contest. It is a convolutional neural network (CNN) based method which uses U-Net architecture, originally designed for biomedical image segmentation. We describe our approach to training data preparation and contest ground truth examination and provide multiple insights on its construction (so called hacking). It led to more accurate historical document binarization problem statement with respect to the challenges one could face in the open access datasets. A docker container with the final network along with all the supplementary data we used in the training process has been published on Github.


Author(s):  
Russell L. Steere ◽  
Eric F. Erbe ◽  
J. Michael Moseley

We have designed and built an electronic device which compares the resistance of a defined area of vacuum evaporated material with a variable resistor. When the two resistances are matched, the device automatically disconnects the primary side of the substrate transformer and stops further evaporation.This approach to controlled evaporation in conjunction with the modified guns and evaporation source permits reliably reproducible multiple Pt shadow films from a single Pt wrapped carbon point source. The reproducibility from consecutive C point sources is also reliable. Furthermore, the device we have developed permits us to select a predetermined resistance so that low contrast high-resolution shadows, heavy high contrast shadows, or any grade in between can be selected at will. The reproducibility and quality of results are demonstrated in Figures 1-4 which represent evaporations at various settings of the variable resistor.


1997 ◽  
Vol 4 (5) ◽  
pp. 407-412
Author(s):  
Donna Corwin Moss

Background Support groups help their participants to cope with the emotional and practical impact of their illnesses. Methods The effectiveness of the Leukemia Society of America support groups in enhancing the quality of life for their participants is reviewed. The groundwork, purpose, and structure of such groups, as well as alternate sources of support, are presented. Evaluation and future directions for oncology groupwork are discussed. Results Support groups complement the therapies provided by clinical practitioners and scientists by addressing the additional needs of cancer patients over the course of illness and survival. Conclusions New concepts and methods that address the needs of specific age-groups and incorporate the newly generated data on cancer treatments will further enhance the benefits provided by support groups.


i-Perception ◽  
2021 ◽  
Vol 12 (3) ◽  
pp. 204166952110203
Author(s):  
Jonas K. Olofsson ◽  
Ingrid Ekström ◽  
Maria Larsson ◽  
Steven Nordin

Olfaction, the sense of smell, is characterized by a notable age-dependency such that aging individuals are more likely to have poor olfactory abilities. These impairments are considered to be mostly irreversible and as having potentially profound effects on quality of life and food behavior, as well as constituting warning signs of mortality, cognitive dysfunction, and dementia. Here, we review the current state of research on aging and olfaction, focusing on five topics which we regard to be of particular relevance for the field: nutrition and health, cognition and dementia, mortality, environment and genetics, and training-based enhancement. Under each of these headlines, we provide a state-of-the-art overview and discuss gaps in our knowledge which might be filled by further research. Understanding how olfactory abilities are diminished in aging, and how they may be alleviated or recovered, involves a set of challenging tasks for researchers in the years to come.


2021 ◽  
Vol 10 (14) ◽  
pp. 3012
Author(s):  
Sandra Giménez ◽  
Miren Altuna ◽  
Esther Blessing ◽  
Ricardo M. Osorio ◽  
Juan Fortea

Sleep disorders, despite being very frequent in adults with Down syndrome (DS), are often overlooked due to a lack of awareness by families and physicians and the absence of specific clinical sleep guidelines. Untreated sleep disorders have a negative impact on physical and mental health, behavior, and cognitive performance. Growing evidence suggests that sleep disruption may also accelerate the progression to symptomatic Alzheimer’s disease (AD) in this population. It is therefore imperative to have a better understanding of the sleep disorders associated with DS in order to treat them, and in doing so, improve cognition and quality of life, and prevent related comorbidities. This paper reviews the current knowledge of the main sleep disorders in adults with DS, including evaluation and management. It highlights the existing gaps in knowledge and discusses future directions to achieve earlier diagnosis and better treatment of sleep disorders most frequently found in this population.


2021 ◽  
Vol 54 (7) ◽  
pp. 1-37
Author(s):  
Jihyeok Park ◽  
Hongki Lee ◽  
Sukyoung Ryu

Understanding program behaviors is important to verify program properties or to optimize programs. Static analysis is a widely used technique to approximate program behaviors via abstract interpretation. To evaluate the quality of static analysis, researchers have used three metrics: performance, precision, and soundness. The static analysis quality depends on the analysis techniques used, but the best combination of such techniques may be different for different programs. To find the best combination of analysis techniques for specific programs, recent work has proposed parametric static analysis . It considers static analysis as black-box parameterized by analysis parameters , which are techniques that may be configured without analysis details. We formally define the parametric static analysis, and we survey analysis parameters and their parameter selection in the literature. We also discuss open challenges and future directions of the parametric static analysis.


Sign in / Sign up

Export Citation Format

Share Document