Development and Analysis of Multilingual Phone Recognition System

In this study, we evaluate and compare two different approaches for multilingual phone recognition in code-switched and non-code-switched scenarios. First approach is a front-end Language Identification (LID)-switched to a monolingual phone recognizer (LID-Mono), trained individually on each of the languages present in multilingual dataset. In the second approach, a common multilingual phone-set derived from the International Phonetic Alphabet (IPA) transcription of the multilingual dataset is used to develop a Multilingual Phone Recognition System (Multi-PRS). The bilingual code-switching experiments are conducted using Kannada and Urdu languages. In the first approach, LID is performed using the state-of-the-art i-vectors. Both monolingual and multilingual phone recognition systems are trained using Deep Neural Networks. The performance of LID-Mono and Multi-PRS approaches are compared and analysed in detail. It is found that the performance of Multi-PRS approach is superior compared to more conventional LID-Mono approach in both code-switched and non-code-switched scenarios. For code-switched speech, the effect of length of segments (that are used to perform LID) on the performance of LID-Mono system is studied by varying the window size from 500 ms to 5.0 s, and full utterance. The LID-Mono approach heavily depends on the accuracy of the LID system and the LID errors cannot be recovered. But, the Multi-PRS system by virtue of not having to do a front-end LID switching and designed based on the common multilingual phone-set derived from several languages, is not constrained by the accuracy of the LID system, and hence performs effectively on code-switched and non-code-switched speech, offering low Phone Error Rates than the LID-Mono system.

Download Full-text

Enhancement of Speech Recognition System by neural network approaches of Clustering

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v6i1.4456 ◽

2013 ◽

Vol 6 (1) ◽

pp. 266-271

Author(s):

Anurag Upadhyay ◽

Chitranjanjit Kaur

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Recurrent Neural Networks ◽

Alternative Energy ◽

Recognition Rate ◽

Speech Sound ◽

Recognition System ◽

Training Methods ◽

Indian Languages ◽

Phone Recognition

This paper addresses the problem of speech recognition to identify various modes of speech data. Speaker sounds are the acoustic sounds of speech. Statistical models of speech have been widely used for speech recognition under neural networks. In paper we propose and try to justify a new model in which speech co articulation the effect of phonetic context on speech sound is modeled explicitly under a statistical framework. We study speech phone recognition by recurrent neural networks and SOUL Neural Networks. A general framework for recurrent neural networks and considerations for network training are discussed in detail. SOUL NN clustering the large vocabulary that compresses huge data sets of speech. This project also different Indian languages utter by different speakers in different modes such as aggressive, happy, sad, and angry. Many alternative energy measures and training methods are proposed and implemented. A speaker independent phone recognition rate of 82% with 25% frame error rate has been achieved on the neural data base. Neural speech recognition experiments on the NTIMIT database result in a phone recognition rate of 68% correct. The research results in this thesis are competitive with the best results reported in the literature.Â

Download Full-text

Two-stage phone recognition system using articulatory and spectral features

2015 International Conference on Signal Processing and Communication Engineering Systems ◽

10.1109/spaces.2015.7058226 ◽

2015 ◽

Cited By ~ 4

Author(s):

K E Manjunath ◽

K. Sreenivasa Rao ◽

M Gurunath Reddy

Keyword(s):

Recognition System ◽

Spectral Features ◽

Two Stage ◽

Phone Recognition

Download Full-text

Multilingual and multimode phone recognition system for Indian languages

Speech Communication ◽

10.1016/j.specom.2020.02.006 ◽

2020 ◽

Vol 119 ◽

pp. 12-23

Author(s):

Kumud Tripathi ◽

M. Kiran Reddy ◽

K. Sreenivasa Rao

Keyword(s):

Recognition System ◽

Indian Languages ◽

Phone Recognition

Download Full-text

Mizo Phone Recognition System

2017 14th IEEE India Council International Conference (INDICON) ◽

10.1109/indicon.2017.8487726 ◽

2017 ◽

Cited By ~ 1

Author(s):

Abhishek Dey ◽

Wendy Lalhminghlui ◽

Priyankoo Sarmah ◽

K. Samudravijaya ◽

S. R. Mahadeva Prasarma ◽

...

Keyword(s):

Recognition System ◽

Phone Recognition

Download Full-text

Pronunciation scoring for Indian English learners using a phone recognition system

Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia - IITM '10 ◽

10.1145/1963564.1963587 ◽

2010 ◽

Cited By ~ 2

Author(s):

Chitralekha Bhat ◽

K. L. Srinivas ◽

Preeti Rao

Keyword(s):

English Learners ◽

Recognition System ◽

Indian English ◽

Phone Recognition

Download Full-text

Development of multilingual phone recognition system for Indian languages

2017 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) ◽

10.1109/spices.2017.8091271 ◽

2017 ◽

Cited By ~ 3

Author(s):

K E Manjunath ◽

K. Sreenivasa Rao ◽

Dinesh Babu Jayagopi

Keyword(s):

Recognition System ◽

Indian Languages ◽

Phone Recognition

Download Full-text

Picture naming reveals the major invariances expected of a shape recognition system

PsycEXTRA Dataset ◽

10.1037/e665402011-411 ◽

1991 ◽

Author(s):

Irving Biederman ◽

Eric E. Cooper ◽

Peter C. Gerhardstein

Keyword(s):

Picture Naming ◽

Shape Recognition ◽

Recognition System

Download Full-text

IRIS AND FINGER VEIN MULTI MODEL RECOGNITION SYSTEM BASED ON SIFT FEATURES

Journal of Advanced Sciences and Engineering Technologies ◽

10.32441/jaset.v1i2.119 ◽

2018 ◽

Vol 1 (2) ◽

pp. 34-44

Author(s):

Faris E Mohammed ◽

Dr. Eman M ALdaidamony ◽

Prof. A. M Raid

Keyword(s):

Iris Recognition ◽

Recognition Performance ◽

Recognition System ◽

Individual Identification ◽

Work Place ◽

Identification Process ◽

Finger Vein ◽

Noise Point ◽

Vein Recognition ◽

A New Technique

Individual identification process is a very significant process that resides a large portion of day by day usages. Identification process is appropriate in work place, private zones, banks …etc. Individuals are rich subject having many characteristics that can be used for recognition purpose such as finger vein, iris, face …etc. Finger vein and iris key-points are considered as one of the most talented biometric authentication techniques for its security and convenience. SIFT is new and talented technique for pattern recognition. However, some shortages exist in many related techniques, such as difficulty of feature loss, feature key extraction, and noise point introduction. In this manuscript a new technique named SIFT-based iris and SIFT-based finger vein identification with normalization and enhancement is proposed for achieving better performance. In evaluation with other SIFT-based iris or SIFT-based finger vein recognition algorithms, the suggested technique can overcome the difficulties of tremendous key-point extraction and exclude the noise points without feature loss. Experimental results demonstrate that the normalization and improvement steps are critical for SIFT-based recognition for iris and finger vein , and the proposed technique can accomplish satisfactory recognition performance. Keywords: SIFT, Iris Recognition, Finger Vein identification and Biometric Systems. © 2018 JASET, International Scholars and Researchers Association

Download Full-text

Thai Buddhist Amulet Recognition System

International Academy of Engineers (IA-E) Dec. 30-31, 2014 Bangkok, Thailand ◽

10.15242/iae.iae1214005 ◽

2014 ◽

Keyword(s):

Recognition System

Download Full-text