A Fuzzy Matching based Image Classification System for Printed and Handwritten Text Documents

This article proposes a bi-leveled image classification system to classify printed and handwritten English documents into mutually exclusive predefined categories. The proposed system follows the steps of preprocessing, segmentation, feature extraction, and SVM based character classification at level 1, and word association and fuzzy matching based document classification at level 2. The system architecture and its modular structure discuss various task stages and their functionalities. Further, a case study on document classification is discussed to show the internal score computations of words and keywords with fuzzy matching. The experiments on proposed system illustrate that the system achieves promising results in the time-efficient manner and achieves better accuracy with less computation time for printed documents than handwritten ones. Finally, the performance of the proposed system is compared with the existing systems and it is observed that proposed system performs better than many other systems.

Download Full-text

A Hybrid Hindi Printed Document Classification System Using SVM and Fuzzy

Journal of Information Technology Research ◽

10.4018/jitr.2019100106 ◽

2019 ◽

Vol 12 (4) ◽

pp. 107-131 ◽

Cited By ~ 3

Author(s):

Shalini Puri ◽

Satya Prakash Singh

Keyword(s):

Classification System ◽

Classification Accuracy ◽

Document Classification ◽

Fuzzy Matching ◽

Left And Right ◽

Execution Times

This article introduces a new advanced tri-layered segmentation and bi-leveled-classifier-based Hindi printed document classification system, which categorizes imaged documents into pre-defined mutually exclusive categories by using SVM and Fuzzy matching at character and document classifications, respectively. During training, the improved and noise-free image is segmented into lines and words by profiling. Then it obtains Shirorekha Less (SL) isolated characters along with upper, left and right modifier components from the SL words. These components use their locations and inter character-modifier component distance to get associate with their corresponding characters only. Further, confidence values of all characters are calculated with SVM training and all characters are mapped into Romanized labels to generate the words. Finally, documents are classified by Fuzzy based matching of Romanized detected words and predefined classes. The average execution times of SL characters are 0.22675 sec. and 0.20375 sec. and classification accuracy are 74.61% and 80.73% for training and testing, respectively.

Download Full-text

Diagnostic Accuracies of Laryngeal Diseases Using a Convolutional Neural Network‐Based Image Classification System

The Laryngoscope ◽

10.1002/lary.29595 ◽

2021 ◽

Author(s):

Won Ki Cho ◽

Yeong Ju Lee ◽

Hye Ah Joo ◽

In Seong Jeong ◽

Yeonjoo Choi ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification System ◽

Laryngeal Diseases

Download Full-text

Image classification system based on deep learning applied to the recognition of traffic signs for intelligent robotic vehicle navigation purposes

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR) ◽

10.1109/sbr-lars-r.2017.8215287 ◽

2017 ◽

Cited By ~ 9

Author(s):

Diego Renan Bruno ◽

Fernando Santos Osorio

Keyword(s):

Deep Learning ◽

Image Classification ◽

Classification System ◽

Vehicle Navigation ◽

Traffic Signs ◽

Robotic Vehicle

Download Full-text

Comment on “The value of a new image classification system for planning treatment and prognosis of spontaneous isolated superior mesenteric artery dissection”

Vascular ◽

10.1177/1708538115598070 ◽

2015 ◽

Vol 23 (5) ◽

pp. 558-558 ◽

Cited By ~ 1

Author(s):

Shaoqin Li ◽

Xiaocheng Gu ◽

Guomin Jiang ◽

Feng Tian

Keyword(s):

Superior Mesenteric Artery ◽

Image Classification ◽

Classification System ◽

Mesenteric Artery ◽

Artery Dissection ◽

Planning Treatment ◽

Superior Mesenteric Artery Dissection

Download Full-text

Line and Word Segmentation of handwritten text documents written in Gurmukhi Script using mid point detection technique

2015 2nd International Conference on Recent Advances in Engineering & Computational Sciences (RAECS) ◽

10.1109/raecs.2015.7453388 ◽

2015 ◽

Cited By ~ 4

Author(s):

Payal Jindal ◽

Balkrishan Jindal

Keyword(s):

Word Segmentation ◽

Detection Technique ◽

Text Documents ◽

Handwritten Text ◽

Point Detection ◽

Gurmukhi Script

Download Full-text

Facial skin image classification system using Convolutional Neural Networks deep learning algorithm

2018 9th International Conference on Awareness Science and Technology (iCAST) ◽

10.1109/icawst.2018.8517246 ◽

2018 ◽

Author(s):

Chiun-Li Chin ◽

Ming-Chieh Chin ◽

Ting-Yu Tsai ◽

Wei-En Chen

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classification System ◽

Learning Algorithm ◽

Facial Skin ◽

Deep Learning Algorithm

Download Full-text

Use of cortical filters and neural networks in a self-organising image classification system

Image Analysis and Processing - Lecture Notes in Computer Science ◽

10.1007/3-540-60298-4_253 ◽

1995 ◽

pp. 165-170

Author(s):

Nikolay Petkov

Keyword(s):

Neural Networks ◽

Image Classification ◽

Classification System

Download Full-text

A High Performace of Local Binary Pattern on Classify Javanese Character Classification

Scientific Journal of Informatics ◽

10.15294/sji.v5i1.14017 ◽

2018 ◽

Vol 5 (1) ◽

pp. 8 ◽

Cited By ~ 1

Author(s):

Ajib Susanto ◽

Daurat Sinaga ◽

Christy Atika Sari ◽

Eko Hari Rachmawanto ◽

De Rosal Ignatius Moses Setiadi

Keyword(s):

Feature Extraction ◽

Image Classification ◽

Local Binary Pattern ◽

Nearest Neighbor ◽

Classification Algorithm ◽

K Nearest Neighbor ◽

Characteristic Extraction ◽

Research Objects ◽

Character Classification

The classification of Javanese character images is done with the aim of recognizing each character. The selected classification algorithm is K-Nearest Neighbor (KNN) at K = 1, 3, 5, 7, and 9. To improve KNN performance in Javanese character written by the author, and to prove that feature extraction is needed in the process image classification of Javanese character. In this study selected Local Binary Patter (LBP) as a feature extraction because there are research objects with a certain level of slope. The LBP parameters are used between [16 16], [32 32], [64 64], [128 128], and [256 256]. Experiments were performed on 80 training drawings and 40 test images. KNN values after combination with LBP characteristic extraction were 82.5% at K = 3 and LBP parameters [64 64].

Download Full-text

Unconstrained Handwritten Text Line Segmentation for Kannada Language

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.j9624.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 953-956

Keyword(s):

Character Recognition ◽

Recognition System ◽

Text Line ◽

Connected Component ◽

Horizontal Projection ◽

Text Documents ◽

Handwritten Text ◽

Kannada Language ◽

System Separation ◽

Line Segmentation

Segmentation is division of something into smaller parts and one of the Component of character recognition system. Separation of characters, words and lines are done in Segmentation from text documents. character recognition is a process which allows computers to recognize written or printed characters such as numbers or letters and to change them into a form that the computer can use. the accuracy of OCR system is done by taking the output of an OCR run for an image and comparing it to the original version of the same text. The main aim of this paper is to find out the various text line segmentations are Projection profiles, Weighted Bucket Method. Proposed method is horizontal projection profile and connected component method on Handwritten Kannada language. These methods are used for experimentation and finally comparing their accuracy and results.

Download Full-text

Hindi Text Document Classification System Using SVM and Fuzzy

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2018100101 ◽

2018 ◽

Vol 5 (4) ◽

pp. 1-31 ◽

Cited By ~ 8

Author(s):

Shalini Puri ◽

Satya Prakash Singh

Keyword(s):

Classification System ◽

Character Recognition ◽

Optical Character Recognition ◽

Document Classification ◽

Data Availability ◽

Support Vector ◽

Handwritten Documents ◽

Text Document ◽

Survey Report ◽

Text Document Classification

In recent years, many information retrieval, character recognition, and feature extraction methodologies in Devanagari and especially in Hindi have been proposed for different domain areas. Due to enormous scanned data availability and to provide an advanced improvement of existing Hindi automated systems beyond optical character recognition, a new idea of Hindi printed and handwritten document classification system using support vector machine and fuzzy logic is introduced. This first pre-processes and then classifies textual imaged documents into predefined categories. With this concept, this article depicts a feasibility study of such systems with the relevance of Hindi, a survey report of statistical measurements of Hindi keywords obtained from different sources, and the inherent challenges found in printed and handwritten documents. The technical reviews are provided and graphically represented to compare many parameters and estimate contents, forms and classifiers used in various existing techniques.

Download Full-text