Lexical classes based stop words categorization for Gujarati language

2018 ◽

Vol 6 (6) ◽

pp. 307

Author(s):

Manish M. Kayasth ◽

Bharat C. Patel

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Recognition Rate ◽

Recognition System ◽

Post Processing ◽

Classification Technique ◽

Scanned Image ◽

Gujarati Language ◽

High Degree ◽

Selection Of

The entire character recognition system is logically characterized into different sections like Scanning, Pre-processing, Classification, Processing, and Post-processing. In the targeted system, the scanned image is first passed through pre-processing modules then feature extraction, classification in order to achieve a high recognition rate. This paper describes mainly on Feature extraction and Classification technique. These are the methodologies which play an important role to identify offline handwritten characters specifically in Gujarati language. Feature extraction provides methods with the help of which characters can identify uniquely and with high degree of accuracy. Feature extraction helps to find the shape contained in the pattern. Several techniques are available for feature extraction and classification, however the selection of an appropriate technique based on its input decides the degree of accuracy of recognition.

Download Full-text

Automatic Speech Recognition of Continuous Speech Signal of Gujarati Language Using Machine Learning

Advances in Intelligent Systems and Computing - Mathematical Modeling, Computational Intelligence Techniques and Renewable Energy ◽

10.1007/978-981-15-9953-8_13 ◽

2021 ◽

pp. 147-159

Author(s):

Purnima Pandit ◽

Priyank Makwana ◽

Shardav Bhatt

Keyword(s):

Machine Learning ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Speech Signal ◽

Continuous Speech ◽

Gujarati Language

Download Full-text

Online Handwritten Gujarati Word Recognition

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2019010103 ◽

2019 ◽

Vol 9 (1) ◽

pp. 35-50 ◽

Cited By ~ 1

Author(s):

Vishal A. Naik ◽

Apurva A. Desai

Keyword(s):

Word Recognition ◽

Recognition System ◽

Support Vector ◽

Post Processing ◽

Mapping Rule ◽

Hybrid Features ◽

Rule Based ◽

Chain Code ◽

Handwritten Word Recognition ◽

Gujarati Language

In this article, an online handwritten word recognition system for the Gujarati language is presented by combining strokes, characters, punctuation marks, and diacritics. The authors have used a support vector machine classification algorithm with a radial basis function kernel. The authors used a hybrid features set. The hybrid feature set consists of directional features with curvature data. The authors have used a normalized chain code and zoning-based chain code features. Words are a combination of characters and diacritics. Recognized strokes require post-processing to form a word. The authors have used location-based and mapping rule-based post-processing methods. The authors have achieved an accuracy of 95.3% for individual characters, 91.5% for individual words, and 83.3% for sentences. The average processing time for individual characters is 0.071 seconds.

Download Full-text

Hybrid Chunker for Gujarati Language

Networking Communication and Data Knowledge Engineering - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-981-10-4585-1_18 ◽

2017 ◽

pp. 217-226

Author(s):

Parneet Kaur ◽

Vishal Goyal ◽

Kritida Shrenik Shah ◽

Umrinderpal Singh

Keyword(s):

Gujarati Language

Download Full-text

Classification of phonemes using modulation spectrogram based features for Gujarati language

2014 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp.2014.6973506 ◽

2014 ◽

Cited By ~ 2

Author(s):

Anshu Chittora ◽

Hemant A. Patil

Keyword(s):

Gujarati Language

Download Full-text

Psychometric Features of Dizziness Handicap Inventory (DHI): Development and Standardization in Gujarati Language

The International Tinnitus Journal ◽

10.5935/0946-5448.20190015 ◽

2019 ◽

Vol 23 (2) ◽

Author(s):

Anuj Kumar Neupane ◽

Arva Kapasi ◽

Nikheel Patel

Keyword(s):

Dizziness Handicap Inventory ◽

Gujarati Language

Download Full-text

Influence of GUJarati STEmmeR in Supervised Learning of Web Page Categorization

International Journal of Intelligent Systems and Applications ◽

10.5815/ijisa.2021.03.03 ◽

2021 ◽

Vol 13 (3) ◽

pp. 23-34

Author(s):

Chandrakant D. Patel ◽

◽

Jayesh M. Patel

Keyword(s):

Machine Learning ◽

Information Retrieval ◽

Research Work ◽

Research Problem ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Web Page ◽

User Query ◽

On Line ◽

Gujarati Language

With the large quantity of information offered on-line, it's equally essential to retrieve correct information for a user query. A large amount of data is available in digital form in multiple languages. The various approaches want to increase the effectiveness of on-line information retrieval but the standard approach tries to retrieve information for a user query is to go looking at the documents within the corpus as a word by word for the given query. This approach is incredibly time intensive and it's going to miss several connected documents that are equally important. So, to avoid these issues, stemming has been extensively utilized in numerous Information Retrieval Systems (IRS) to extend the retrieval accuracy of all languages. These papers go through the problem of stemming with Web Page Categorization on Gujarati language which basically derived the stem words using GUJSTER algorithms [1]. The GUJSTER algorithm is based on morphological rules which is used to derived root or stem word from inflected words of the same class. In particular, we consider the influence of extracted a stem or root word, to check the integrity of the web page classification using supervised machine learning algorithms. This research work is intended to focus on the analysis of Web Page Categorization (WPC) of Gujarati language and concentrate on a research problem to do verify the influence of a stemming algorithm in a WPC application for the Gujarati language with improved accuracy between from 63% to 98% through Machine Learning supervised models with standard ratio 80% as training and 20% as testing.

Download Full-text