scholarly journals Framework of diacritic segmentation for Arabic handwritten document

Author(s):  
Ahmed Abdalla Shiekh ◽  
Mohd Sanusi Azmi ◽  
Maslita Abd Aziz ◽  
Mohammed Nasser Al-Mhiqani ◽  
Salem Saleh Bafjaish

<span lang="EN-US">In <span>recent Arabic standard language and Arabic dialectal texts, diacritics and short vowels are absent. There are some exceptions have been made for the Arabic beginner learner scripts, religious texts and as well as a significant political text. In addition, the text without diacritics is considered ambiguous due to numerous words with different diacritic marks seem identical. However, this paper we present a framework for segmenting diacritics from Arabic handwritten document by using region-based segmentation technique. Since Arabic handwritten and Mushaf Al-Quran contain many diacritical marks. Hence, the diacritics must be properly extracted from Arabic handwritten document to avoid losing some good features. Furthermore, the proposed framework is devised specifically to segment diacritics from Arabic handwritten image, thus there will be no feature extraction, feature selection, and classification processes included. Besides, we will present the methodology that is used to fulfil the objectives of this paper. The pre-processing phases will be explained and more specifically segmentation phase for segmenting diacritics which is the phase we concentrate more in this article. Lastly, we will identify the proposed technique region-based segmentation to facilitate our development throughout the experimental process.</span></span>

2020 ◽  
pp. 17-23
Author(s):  
Neeraj Kumari ◽  
Ashutosh Kumar Bhatt ◽  
Rakesh Kumar Dwivedi ◽  
Rajendra Belwal

Image segmentation is an essential and critical step in huge number of applications of image processing. Accuracy of image segmentation influence retrieved information for further processing in classification and other task. In image segmentation algorithms, a single segmentation technique is not sufficient in providing accurate segmentation results in many cases. In this paper we are proposing a combining approach of image segmentation techniques for improving segmentation accuracy. As a case study fruit mango is selected for classification based on surface defect. This classification method consists of three steps: (a) image pre-processing, (b) feature extraction and feature selection and (c) classification of mango. Feature extraction phase is performed on an enhanced input image. In feature selection PCA methodology is used. In classification three classifiers BPNN, Naïve bayes and LDA are used. Proposed image segmentation technique is tested on online dataset and our own collected images database. Proposed segmentation technique performance is compared with existing segmentation techniques. Classification results of BPNN in training and testing phase are acceptable for proposed segmentation technique.


2012 ◽  
Vol 532-533 ◽  
pp. 1191-1195 ◽  
Author(s):  
Zhen Yan Liu ◽  
Wei Ping Wang ◽  
Yong Wang

This paper introduces the design of a text categorization system based on Support Vector Machine (SVM). It analyzes the high dimensional characteristic of text data, the reason why SVM is suitable for text categorization. According to system data flow this system is constructed. This system consists of three subsystems which are text representation, classifier training and text classification. The core of this system is the classifier training, but text representation directly influences the currency of classifier and the performance of the system. Text feature vector space can be built by different kinds of feature selection and feature extraction methods. No research can indicate which one is the best method, so many feature selection and feature extraction methods are all developed in this system. For a specific classification task every feature selection method and every feature extraction method will be tested, and then a set of the best methods will be adopted.


Sign in / Sign up

Export Citation Format

Share Document