Improved document skew detection based on text line connected-component clustering

Segmentation is division of something into smaller parts and one of the Component of character recognition system. Separation of characters, words and lines are done in Segmentation from text documents. character recognition is a process which allows computers to recognize written or printed characters such as numbers or letters and to change them into a form that the computer can use. the accuracy of OCR system is done by taking the output of an OCR run for an image and comparing it to the original version of the same text. The main aim of this paper is to find out the various text line segmentations are Projection profiles, Weighted Bucket Method. Proposed method is horizontal projection profile and connected component method on Handwritten Kannada language. These methods are used for experimentation and finally comparing their accuracy and results.

Download Full-text

Combined orientation and skew detection using geometric text-line modeling

International Journal on Document Analysis and Recognition (IJDAR) ◽

10.1007/s10032-009-0109-5 ◽

2010 ◽

Vol 13 (2) ◽

pp. 79-92 ◽

Cited By ~ 18

Author(s):

Joost van Beusekom ◽

Faisal Shafait ◽

Thomas M. Breuel

Keyword(s):

Text Line ◽

Skew Detection

Download Full-text

Skew detection and text line position determination in digitized documents

Pattern Recognition ◽

10.1016/s0031-3203(96)00157-4 ◽

1997 ◽

Vol 30 (9) ◽

pp. 1505-1519 ◽

Cited By ~ 66

Author(s):

B. Gatos ◽

N. Papamarkos ◽

C. Chamzas

Keyword(s):

Text Line ◽

Line Position ◽

Position Determination ◽

Skew Detection ◽

Digitized Documents

Download Full-text

Text line processing for high-confidence skew detection in image documents

Proceedings of the 2010 IEEE 6th International Conference on Intelligent Computer Communication and Processing ◽

10.1109/iccp.2010.5606448 ◽

2010 ◽

Cited By ~ 3

Author(s):

Daniel Rosner ◽

Costin-Anton Boiangiu ◽

Alexandru Stefanescu ◽

Nicolae Tapus ◽

Alexandra Olteanu

Keyword(s):

Text Line ◽

High Confidence ◽

Skew Detection

Download Full-text

Global Skew Detection and Correction of Document Image Based on Least Square Method and Extensive Connected Component Analysis

10.1007/978-981-16-4149-7_38 ◽

2021 ◽

pp. 429-440

Author(s):

Faisal Imran ◽

Md. Ali Hossain ◽

Md. Al Mamun ◽

Bhupesh Kumar Singh ◽

Tanupriya Choudhury

Keyword(s):

Least Square Method ◽

Component Analysis ◽

Document Image ◽

Least Square ◽

Connected Component ◽

Connected Component Analysis ◽

Skew Detection

Download Full-text

Touching text line segmentation combined local baseline and connected component for Uchen Tibetan historical documents

Information Processing & Management ◽

10.1016/j.ipm.2021.102689 ◽

2021 ◽

Vol 58 (6) ◽

pp. 102689

Author(s):

Pengfei Hu ◽

Weilan Wang ◽

Qiaoqiao Li ◽

Tiejun Wang

Keyword(s):

Historical Documents ◽

Text Line ◽

Connected Component ◽

Text Line Segmentation ◽

Line Segmentation

Download Full-text

Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis

Lecture Notes in Computer Science - Pattern Recognition and Machine Intelligence ◽

10.1007/978-3-642-11164-8_60 ◽

2009 ◽

pp. 369-374 ◽

Cited By ~ 14

Author(s):

Abhishek Khandelwal ◽

Pritha Choudhury ◽

Ram Sarkar ◽

Subhadip Basu ◽

Mita Nasipuri ◽

...

Keyword(s):

Component Analysis ◽

Text Line ◽

Document Images ◽

Connected Component ◽

Connected Component Analysis ◽

Handwritten Document ◽

Text Line Segmentation ◽

Line Segmentation

Download Full-text

A Hybrid Approach for Skew Detection and Correction in the Multi-script Scanned Document

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2019/v4i230112 ◽

2019 ◽

pp. 1-8

Author(s):

M. Ramanan

Keyword(s):

Hybrid Method ◽

Hough Transform ◽

Character Recognition ◽

Optical Character Recognition ◽

Hybrid Approach ◽

Wiener Filter ◽

Text Line ◽

Skew Angle ◽

Skew Detection ◽

Optical Character

Skew detection and correction of a scanned document is a very important step in Optical Character Recognition because skew of scanned document is reducing the accuracy of text line approach for skew detection and correction to calculate the skew angle on multi-script scanned document using Radon transform, Hough transform, Harries corner, Wiener filter and smearing algorithm. In this paper, a proposed approach is compared existing skew detection and correction techniques for printed documents having different scripts: English, Tamil, Sinhala and mixed-script. A proposed hybrid method is tested on 160 documents. The overall testing results is 90.62% for skew detection and correction.

Download Full-text

Arabic handwritten text line extraction using connected component analysis from a multi agent perspective

2015 15th International Conference on Intelligent Systems Design and Applications (ISDA) ◽

10.1109/isda.2015.7489204 ◽

2015 ◽

Cited By ~ 4

Author(s):

Youssef Boulid ◽

Abdelghani Souhar ◽

Mohamed Youssfi Elkettani

Keyword(s):

Component Analysis ◽

Text Line ◽

Connected Component ◽

Connected Component Analysis ◽

Line Extraction ◽

Handwritten Text ◽

Multi Agent ◽

Text Line Extraction

Download Full-text

Improved document skew detection based on text line connected-component clustering

A skew detection and correction technique for Arabic script text-line based on subwords bounding

Unconstrained Handwritten Text Line Segmentation for Kannada Language

Combined orientation and skew detection using geometric text-line modeling

Skew detection and text line position determination in digitized documents

Text line processing for high-confidence skew detection in image documents

Global Skew Detection and Correction of Document Image Based on Least Square Method and Extensive Connected Component Analysis

Touching text line segmentation combined local baseline and connected component for Uchen Tibetan historical documents

Text Line Segmentation for Unconstrained Handwritten Document Images Using Neighborhood Connected Component Analysis

A Hybrid Approach for Skew Detection and Correction in the Multi-script Scanned Document

Arabic handwritten text line extraction using connected component analysis from a multi agent perspective

Export Citation Format