Development and analysis of multilingual phone recognition systems using Indian languages

In this study, we evaluate and compare two different approaches for multilingual phone recognition in code-switched and non-code-switched scenarios. First approach is a front-end Language Identification (LID)-switched to a monolingual phone recognizer (LID-Mono), trained individually on each of the languages present in multilingual dataset. In the second approach, a common multilingual phone-set derived from the International Phonetic Alphabet (IPA) transcription of the multilingual dataset is used to develop a Multilingual Phone Recognition System (Multi-PRS). The bilingual code-switching experiments are conducted using Kannada and Urdu languages. In the first approach, LID is performed using the state-of-the-art i-vectors. Both monolingual and multilingual phone recognition systems are trained using Deep Neural Networks. The performance of LID-Mono and Multi-PRS approaches are compared and analysed in detail. It is found that the performance of Multi-PRS approach is superior compared to more conventional LID-Mono approach in both code-switched and non-code-switched scenarios. For code-switched speech, the effect of length of segments (that are used to perform LID) on the performance of LID-Mono system is studied by varying the window size from 500 ms to 5.0 s, and full utterance. The LID-Mono approach heavily depends on the accuracy of the LID system and the LID errors cannot be recovered. But, the Multi-PRS system by virtue of not having to do a front-end LID switching and designed based on the common multilingual phone-set derived from several languages, is not constrained by the accuracy of the LID system, and hence performs effectively on code-switched and non-code-switched speech, offering low Phone Error Rates than the LID-Mono system.

Download Full-text

Articulatory-feature-based methods for performance improvement of Multilingual Phone Recognition Systems using Indian languages

Sadhana ◽

10.1007/s12046-020-01428-9 ◽

2020 ◽

Vol 45 (1) ◽

Author(s):

K E Manjunath ◽

Dinesh Babu Jayagopi ◽

K Sreenivasa Rao ◽

V Ramasubramanian

Keyword(s):

Performance Improvement ◽

Indian Languages ◽

Phone Recognition ◽

Feature Based ◽

Recognition Systems

Download Full-text

Enhancement of Speech Recognition System by neural network approaches of Clustering

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v6i1.4456 ◽

2013 ◽

Vol 6 (1) ◽

pp. 266-271

Author(s):

Anurag Upadhyay ◽

Chitranjanjit Kaur

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Recurrent Neural Networks ◽

Alternative Energy ◽

Recognition Rate ◽

Speech Sound ◽

Recognition System ◽

Training Methods ◽

Indian Languages ◽

Phone Recognition

This paper addresses the problem of speech recognition to identify various modes of speech data. Speaker sounds are the acoustic sounds of speech. Statistical models of speech have been widely used for speech recognition under neural networks. In paper we propose and try to justify a new model in which speech co articulation the effect of phonetic context on speech sound is modeled explicitly under a statistical framework. We study speech phone recognition by recurrent neural networks and SOUL Neural Networks. A general framework for recurrent neural networks and considerations for network training are discussed in detail. SOUL NN clustering the large vocabulary that compresses huge data sets of speech. This project also different Indian languages utter by different speakers in different modes such as aggressive, happy, sad, and angry. Many alternative energy measures and training methods are proposed and implemented. A speaker independent phone recognition rate of 82% with 25% frame error rate has been achieved on the neural data base. Neural speech recognition experiments on the NTIMIT database result in a phone recognition rate of 68% correct. The research results in this thesis are competitive with the best results reported in the literature.Â

Download Full-text

A Zone Based Approach for Classification and Recognition Of Telugu Handwritten Characters

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i4.10553 ◽

2016 ◽

Vol 6 (4) ◽

pp. 1647

Author(s):

N. Shobha Rani ◽

Sanjay Kumar Verma ◽

Anitta Joseph

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Optical Character Recognition ◽

Vital Role ◽

Indian Languages ◽

Processing Application ◽

Image Objects ◽

South Indian ◽

Recognition Systems ◽

The Times

Realization of high accuracies and efficiencies in South Indian character recognition systems is one of the principle goals to be attempted time after time so as to promote the usage of optical character recognition (OCR) for South Indian languages like Telugu. The process of character recognition comprises pre-processing, segmentation, feature extraction, classification and recognition. The feature extraction stage is meant for uniquely recognizing each character image for the purpose of classifying it. The selection of a feature extraction algorithm is very critical and important for any image processing application and mostly of the times it is directly proportional to the type of the image objects that we have to identify. For optical technologies like South Indian OCR, the feature extraction technique plays a very vital role in accuracy of recognition due to the huge character sets. In this work we mainly focus on evaluating the performance of various feature extraction techniques with respect to Telugu character recognition systems and analyze its efficiencies and accuracies in recognition of Telugu character set.

Download Full-text

WAVELET DESCRIPTORS FOR RECOGNITION OF BASIC SYMBOLS IN PRINTED KANNADA TEXT

International Journal of Wavelets Multiresolution and Information Processing ◽

10.1142/s0219691307001793 ◽

2007 ◽

Vol 05 (02) ◽

pp. 351-367 ◽

Cited By ~ 12

Author(s):

R. SANJEEV KUNTE ◽

R. D. SUDHAKER SAMUEL

Keyword(s):

Character Recognition ◽

Optical Character Recognition ◽

Indian Languages ◽

Indian Language ◽

South Indian ◽

Wavelet Features ◽

On Line ◽

Recognition Systems ◽

System Methodology

Optical Character Recognition (OCR) systems have been effectively developed for the recognition of printed characters of non-Indian languages. Efforts are underway for the development of efficient OCR systems for Indian languages, especially for Kannada, a popular South Indian language. We present in this paper an OCR system developed for the recognition of basic characters in printed Kannada text, which can handle different font sizes and font sets. Wavelets that have been progressively used in pattern recognition and on-line character recognition systems are used in our system to extract the features of printed Kannada characters. Neural classifiers have been effectively used for the classification of characters based on wavelet features. The system methodology can be extended for the recognition of other south Indian languages, especially for Telugu.

Download Full-text

Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion

10.21437/interspeech.2018-2529 ◽

2018 ◽

Author(s):

Manjunath K E ◽

K. Sreenivasa Rao ◽

Dinesh Babu Jayagopi ◽

V Ramasubramanian

Keyword(s):

Feature Fusion ◽

Indian Languages ◽

Phone Recognition ◽

Articulatory Features

Download Full-text

Multilingual and multimode phone recognition system for Indian languages

Speech Communication ◽

10.1016/j.specom.2020.02.006 ◽

2020 ◽

Vol 119 ◽

pp. 12-23

Author(s):

Kumud Tripathi ◽

M. Kiran Reddy ◽

K. Sreenivasa Rao

Keyword(s):

Recognition System ◽

Indian Languages ◽

Phone Recognition

Download Full-text

Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

2013 Annual IEEE India Conference (INDICON) ◽

10.1109/indcon.2013.6726109 ◽

2013 ◽

Cited By ~ 9

Author(s):

K E Manjunath ◽

S. B. Sunil Kumar ◽

Debadatta Pati ◽

Biswajit Satapathy ◽

K. Sreenivasa Rao

Keyword(s):

Indian Languages ◽

Recognition Systems ◽

Vowel Recognition

Download Full-text

A Zone Based Approach for Classification and Recognition Of Telugu Handwritten Characters

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i4.pp1647-1653 ◽

2016 ◽

Vol 6 (4) ◽

pp. 1647

Author(s):

N. Shobha Rani ◽

Sanjay Kumar Verma ◽

Anitta Joseph

Keyword(s):

Feature Extraction ◽

Character Recognition ◽

Optical Character Recognition ◽

Vital Role ◽

Indian Languages ◽

Processing Application ◽

Image Objects ◽

South Indian ◽

Recognition Systems ◽

The Times

Realization of high accuracies and efficiencies in South Indian character recognition systems is one of the principle goals to be attempted time after time so as to promote the usage of optical character recognition (OCR) for South Indian languages like Telugu. The process of character recognition comprises pre-processing, segmentation, feature extraction, classification and recognition. The feature extraction stage is meant for uniquely recognizing each character image for the purpose of classifying it. The selection of a feature extraction algorithm is very critical and important for any image processing application and mostly of the times it is directly proportional to the type of the image objects that we have to identify. For optical technologies like South Indian OCR, the feature extraction technique plays a very vital role in accuracy of recognition due to the huge character sets. In this work we mainly focus on evaluating the performance of various feature extraction techniques with respect to Telugu character recognition systems and analyze its efficiencies and accuracies in recognition of Telugu character set.

Download Full-text

Multilingual Phone Recognition in Indian Languages

10.1007/978-3-030-80741-2 ◽

2022 ◽

Author(s):

K.E Manjunath

Keyword(s):

Indian Languages ◽

Phone Recognition

Download Full-text