TWO-PASS SEGMENTATION FOR MARATHI CHARACTER
EXTRACTION

This paper describes methods of image analysis for historical Japanese book archives with a dominant focus on character segmentation. The segmentation methodology includes stain and smear removal, binarization, character line extraction, and character extraction by region labeling with integration and separation techniques. The experimental results show that the proposed method can segment all text lines correctly and can extract more than 79% of the characters from 16 pages of Chinsetsu Yumiharizuki, containing 176 text lines and a total of 5181 quite complicated characters.

Download Full-text

Robust Deformable Matching for Character Extraction

Series in Machine Perception and Artificial Intelligence - Advances in Handwriting Recognition ◽

10.1142/9789812797650_0023 ◽

1999 ◽

pp. 235-244 ◽

Cited By ~ 2

Author(s):

Kwok-Wai Cheung ◽

Dit-Yan Yeung ◽

Roland T. Chin

Keyword(s):

Character Extraction

Download Full-text

MULTISCALE CHARACTER EXTRACTION OF LFM SIGNAL

Wavelet Analysis and Active Media Technology ◽

10.1142/9789812701695_0207 ◽

2005 ◽

pp. 1341-1345

Author(s):

JIAN KANG ◽

GUOSHENG RUI

Keyword(s):

Character Extraction

Download Full-text

ViOC-optical Alphanumeric Character Extraction from Video Frames

Research Journal of Applied Sciences Engineering and Technology ◽

10.19026/rjaset.8.991 ◽

2014 ◽

Vol 8 (3) ◽

pp. 439-442

Author(s):

Resmi R. Nair ◽

A. Shobana ◽

T. Abhinaya ◽

S. Sibi Chakkaravarthy

Keyword(s):

Alphanumeric Character ◽

Video Frames ◽

Character Extraction

Download Full-text

Robust Text Line, Word And Character Extraction from Telugu Document Image

2009 Second International Conference on Emerging Trends in Engineering & Technology ◽

10.1109/icetet.2009.196 ◽

2009 ◽

Cited By ~ 7

Author(s):

Vijaya Kumar Koppula ◽

Negi Atul ◽

Utpal Garain

Keyword(s):

Document Image ◽

Text Line ◽

Character Extraction

Download Full-text

Character extraction by integrating color into edge-based methods

2015 14th IAPR International Conference on Machine Vision Applications (MVA) ◽

10.1109/mva.2015.7153136 ◽

2015 ◽

Author(s):

Naoki Chiba ◽

Xinhao Liu

Keyword(s):

Character Extraction ◽

Edge Based

Download Full-text

Character extraction from natural scene images by hierarchical classifiers

Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004. ◽

10.1109/icpr.2004.1334352 ◽

2004 ◽

Cited By ~ 3

Author(s):

T. Yamguchi ◽

M. Maruyama

Keyword(s):

Natural Scene ◽

Character Extraction ◽

Natural Scene Images

Download Full-text

A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images

International Journal of Image and Graphics ◽

10.1142/s021946781450003x ◽

2014 ◽

Vol 14 (01n02) ◽

pp. 1450003 ◽

Cited By ~ 2

Author(s):

S. A. Angadi ◽

M. M. Kodabagi

Keyword(s):

Extraction Process ◽

Word Segmentation ◽

Text Line ◽

Low Resolution ◽

Data Set ◽

Display Board ◽

Robust Segmentation ◽

Character Extraction ◽

Segmentation Accuracy ◽

Line Segmentation

Reliable extraction/segmentation of text lines, words and characters is one of the very important steps for development of automated systems for understanding the text in low resolution display board images. In this paper, a new approach for segmentation of text lines, words and characters from Kannada text in low resolution display board images is presented. The proposed method uses projection profile features and on pixel distribution statistics for segmentation of text lines. The method also detects text lines containing consonant modifiers and merges them with corresponding text lines, and efficiently separates overlapped text lines as well. The character extraction process computes character boundaries using vertical profile features for extracting character images from every text line. Further, the word segmentation process uses k-means clustering to group inter character gaps into character and word cluster spaces, which are used to compute thresholds for extracting words. The method also takes care of variations in character and word gaps. The proposed methodology is evaluated on a data set of 1008 low resolution images of display boards containing Kannada text captured from 2 mega pixel cameras on mobile phones at various sizes 240 × 320, 480 × 640 and 960 × 1280. The method achieves text line segmentation accuracy of 97.17%, word segmentation accuracy of 97.54% and character extraction accuracy of 99.09%. The proposed method is tolerant to font variability, spacing variations between characters and words, absence of free segmentation path due to consonant and vowel modifiers, noise and other degradations. The experimentation with images containing overlapped text lines has given promising results.

Download Full-text

TWO-PASS SEGMENTATION FOR MARATHI CHARACTER EXTRACTION

Stroke-model-based character extraction from gray-level document images

Character Extraction and Recognition For Myanmar Script Signboard Images using Block based Pixel Count and Chain Codes

Image Analysis for Historical Japanese Book Archives

Robust Deformable Matching for Character Extraction

MULTISCALE CHARACTER EXTRACTION OF LFM SIGNAL

ViOC-optical Alphanumeric Character Extraction from Video Frames

Robust Text Line, Word And Character Extraction from Telugu Document Image

Character extraction by integrating color into edge-based methods

Character extraction from natural scene images by hierarchical classifiers

A Robust Segmentation Technique for Line, Word and Character Extraction from Kannada Text in Low Resolution Display Board Images