Hidden Markov Model Based Visemes Recognition, Part II

2009 ◽  
pp. 356-387
Author(s):  
Say Wei Foo ◽  
Liang Donga

The basic building blocks of visual speech are the visemes. Unlike phonemes, the visemes are, however, confusable and easily distorted by the contexts in which they appear. Classifiers capable of distinguishing the minute difference among the different categories are desirable. In this chapter, we describe two Hidden Markov Model based techniques using the discriminative approach to increase the accuracy of visual speech recognition. The approaches investigated include Maximum Separable Distance (MSD) training strategy (Dong, 2005) and Two-channel training approach (Dong, 2005; Foo, 2003; Foo, 2002) The MSD training strategy and the Two-channel training approach adopt a proposed criterion function called separable distance to improve the discriminative power of an HMM. The methods are applied to identify confusable visemes. Experimental results indicate that higher recognition accuracy can be attained using these approaches than that using conventional HMM.

2009 ◽  
pp. 326-355
Author(s):  
Say Wei Foo ◽  
Liang Donga

Visual speech recognition is able to supplement the information of speech sound to improve the accuracy of speech recognition. A viseme, which describes the facial and oral movements that occur alongside the voicing of a particular phoneme, is a supposed basic unit of speech in the visual domain. As in phonemes, there are variations for the same viseme expressed by different persons or even by the same person. A classifier must be robust to this kind of variation. In this chapter, the author’s describe the Adaptively Boosted (AdaBoost) Hidden Markov Model (HMM) technique (Foo, 2004; Foo, 2003; Dong, 2002). By applying the AdaBoost technique to HMM modeling, a multi-HMM classifier that improves the robustness of HMM is obtained. The method is applied to identify context-independent and contextdependent visual speech units. Experimental results indicate that higher recognition accuracy can be attained using the AdaBoost HMM than that using conventional HMM.


2018 ◽  
Vol 106 (4) ◽  
pp. 2129-2147
Author(s):  
Usha Sharma ◽  
Sushila Maheshkar ◽  
A. N. Mishra ◽  
Rahul Kaushik

Bioacoustics ◽  
2019 ◽  
Vol 29 (2) ◽  
pp. 140-167 ◽  
Author(s):  
Susannah J. Buchan ◽  
Rodrigo Mahú ◽  
Jorge Wuth ◽  
Naysa Balcazar-Cabrera ◽  
Laura Gutierrez ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document