Efficient Skew Detection and Correction in Scanned Document Images Through Clustering of Probabilistic Hough Transforms

This article presents an automatic system that takes in grayscale scanned images, which could be mixed text/graphic documents, and performs thresholding and skew detection on the document images. The system consists of two major components; multistage thresholding and skew detection. The proposed skew detection algorithm has no restriction on detectable angle range and does not rely on large blocks of text. It works well on textual document images, graphical images and mixed text and graphic images. The performance of the systems was evaluated using over 60 images that consist of real life documents like envelopes and artificial mixed text/graphic icons. The superior performance of thresholding is clear compared to other techniques from the evaluation. The skew detection algorithm is robust when compared with other methods when very few text lines are present in the document image.

Download Full-text

LANGUAGE INDEPENDENT ROBUST SKEW DETECTION AND CORRECTION TECHNIQUE FOR DOCUMENT IMAGES

International Journal of Electronics Signals and Systems ◽

10.47893/ijess.2012.1077 ◽

2012 ◽

pp. 111-115

Author(s):

Neha. N

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Document Analysis ◽

Document Image ◽

Document Images ◽

Novel Technique ◽

Skew Detection ◽

Optical Character ◽

Correction Technique ◽

All Optical

Document image processing is an increasingly important technology essential in all optical character recognition (OCR) systems and for automation of various office documents. A document originally has zero-skew (tilt), but when a page is scanned or photo copied, skew may be introduced due to various factors and is practically unavoidable. Presence even a small amount of skew (0.50) will have detrimental effects on document analysis as it has a direct effect on the reliability and efficiency of segmentation, recognition and feature extraction stages. Therefore removal of skew is of paramount importance in the field of document analysis and OCR and is the first step to be accomplished. This paper presents a novel technique for skew detection and correction which is both language and content independent. The proposed technique is based on the maximum density of the foreground pixels and their orientation in the document image. Unlike other conventional algorithms which work only for machine printed textual documents scripted in English, this technique works well for all kinds of document images (machine printed, hand written, complex, noisy and simple). The technique presented here is tested with 150 different document image samples and is found to provide results with an accuracy of 0.10

Download Full-text

Improving Skew Detection and Correction in Different Document Images Using a Deep Learning Approach

2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT) ◽

10.1109/icccnt49239.2020.9225619 ◽

2020 ◽

Author(s):

Shaheera Saba Mohd Naseem Akhter ◽

Priti P Rege

Keyword(s):

Deep Learning ◽

Learning Approach ◽

Document Images ◽

Skew Detection

Download Full-text

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Journal of Computer Science ◽

10.3844/jcssp.2009.363.368 ◽

2009 ◽

Vol 5 (5) ◽

pp. 363-368 ◽

Cited By ~ 15

Author(s):

Atallah Mahmoud Al-Shatnaw ◽

Khairuddin Omar

Keyword(s):

Document Images ◽

Centre Of Gravity ◽

Skew Detection ◽

Correction Technique

Download Full-text

Efficient Skew Detection and Correction in Scanned Document Images Through Clustering of Probabilistic Hough Transforms

Efficient skew detection of printed document images based on novel combination of enhanced profiles

Skew Detection of Document Images Using Line Structural Information

Skew detection for binary document images using mathematical morphyology

New Fast Content Based Skew Detection Algorithm for Document Images

A fast orientation and skew detection algorithm for monochromatic document images

A new boundary growing and hough transform based approach for accurate skew detection in binary document images

A ROBUST SYSTEM FOR THRESHOLDING AND SKEW DETECTION IN MIXED TEXT/GRAPHICS DOCUMENTS

LANGUAGE INDEPENDENT ROBUST SKEW DETECTION AND CORRECTION TECHNIQUE FOR DOCUMENT IMAGES

Improving Skew Detection and Correction in Different Document Images Using a Deep Learning Approach

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Export Citation Format