scholarly journals Feature Level Solution to Noise Robust Speech Recognition in the context of Tonal Languages

Performance of a speech recognition system is highly dependent on the operational environments. The mismatched ambient conditions have adverse impact on the performance of an Automatic Speech Recognition (ASR) system. The speech parameterization techniques for tonal speech recognition are different from those used for non-tonal speech recognition. It is due to the fact that tonal speech has two components – basic linguistic unit and tone. The basic linguistic unit with different tones convey different meanings. Therefore, the feature set used for tonal speech recognition must have the capability to representing both of them. Tone is determined by the fundamental frequency of the speech signal which is highly sensitive to noise. Since at the time of parameterization of the non-tonal speech recognition systems, these highly noise-sensitive tone related information are discarded, the traditional noise elimination methods used for non-tonal speech recognition fail to deliver robust performance in tonal speech recognition. In the present study, we have analyze the performance of different commonly used feature sets for noisy tonal speech recognition. Hidden Markov Model (HMM) based speech recognizer has been used for performance evaluation. Noise elimination techniques sub-band spectral subtraction and Wiener filter have been used for noise reduction and their relative performance have been evaluated.

2011 ◽  
Vol 217-218 ◽  
pp. 413-418
Author(s):  
Xue Mei Hou

Considering the actuality of current speech recognition and the characteristic of RBF neural network, a noise-robust speech recognition system based on RBF neural network is proposed with the entire-supervised algorithm. If the traditional clustering algorithm is employed, there is a flaw that the node center of hidden layer is always sensitive to the initial value, but if the entire-supervised algorithm is used, the flaw will not turn up, and the classification ability of RBF network will be enhanced. Experimental results show that, compared with the traditional clustering algorithm, the entire-supervised algorithm is of higher recognition rate in different SNRs than that of clustering algorithm.


2004 ◽  
Vol 25 (2) ◽  
pp. 166-169 ◽  
Author(s):  
Shunsuke Ishimitsu ◽  
Hironori Kitakaze ◽  
Yasuyuki Tsuchibushi ◽  
Hirofumi Yanagawa ◽  
Manabu Fukushima

Sign in / Sign up

Export Citation Format

Share Document