Characteristics of the use of coupled hidden Markov models for audio-visual polish speech recognition

Bulletin of the Polish Academy of Sciences Technical Sciences ◽

10.2478/v10175-012-0041-6 ◽

2012 ◽

Vol 60 (2) ◽

pp. 307-316 ◽

Author(s):

M. Kubanek ◽

J. Bobulski ◽

L. Adrjanowicz

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Visual Signals ◽

Visual Speech Recognition ◽

Coupled Hidden Markov Models ◽

Visual Characteristic ◽

Abstract. This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of the highly disturbed audio speech signal. Recognition of audio-visual speech was based on combined hidden Markov models (CHMM). The described methods were developed for a single isolated command, nevertheless their effectiveness indicated that they would also work similarly in continuous audiovisual speech recognition. The problem of a visual speech analysis is very difficult and computationally demanding, mostly because of an extreme amount of data that needs to be processed. Therefore, the method of audio-video speech recognition is used only while the audiospeech signal is exposed to a considerable level of distortion. There are proposed the authors’ own methods of the lip edges detection and a visual characteristic extraction in this paper. Moreover, the method of fusing speech characteristics for an audio-video signal was proposed and tested. A significant increase of recognition effectiveness and processing speed were noted during tests - for properly selected CHMM parameters and an adequate codebook size, besides the use of the appropriate fusion of audio-visual characteristics. The experimental results were very promising and close to those achieved by leading scientists in the field of audio-visual speech recognition.

Download Full-text

Training Hidden Markov Models by Hybrid Simulated Annealing for Visual Speech Recognition

2006 IEEE International Conference on Systems, Man and Cybernetics ◽

10.1109/icsmc.2006.384382 ◽

2006 ◽

Author(s):

Jong-Seok Lee ◽

Cheol Hoon Park

Keyword(s):

Simulated Annealing ◽

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Visual Speech Recognition ◽

Hybrid Simulated Annealing

Download Full-text

Discriminative training of hidden Markov models by multiobjective optimization for visual speech recognition

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005. ◽

10.1109/ijcnn.2005.1556216 ◽

2006 ◽

Author(s):

Jong-Seok Lee ◽

Cheol Hoon Park

Keyword(s):

Speech Recognition ◽

Multiobjective Optimization ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Discriminative Training ◽

Visual Speech ◽

Visual Speech Recognition

Download Full-text

Visual Speech Recognition Using Motion Features and Hidden Markov Models

Computer Analysis of Images and Patterns - Lecture Notes in Computer Science ◽

10.1007/978-3-540-74272-2_103 ◽

2007 ◽

pp. 832-839 ◽

Author(s):

Wai Chee Yau ◽

Dinesh Kant Kumar ◽

Hans Weghorn

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Visual Speech Recognition ◽

Motion Features

Download Full-text

The Geometrical Based Lip-Reading Techniques of Multi-Dimensional Dynamic Time Warping MDTW and Hidden Markov Models HMMs in the Audio Visual Speech Recognition

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2020/68912020 ◽

2020 ◽

Vol 9 (1) ◽

pp. 496-504

Author(s):

Muhammad Ismail Mohmand

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Dynamic Time Warping ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Time Warping ◽

Visual Speech Recognition ◽

Lip Reading ◽

Download Full-text

Recognition of Human Movements Using Hidden Markov Models - An Application to Visual Speech Recognition

Proceedings of the 7th International Workshop on Pattern Recognition in Information Systems ◽

10.5220/0002424901510160 ◽

2007 ◽

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Visual Speech Recognition ◽

Human Movements

Download Full-text

Visual speech recognition using active shape models and hidden Markov models

1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings ◽

10.1109/icassp.1996.543246 ◽

2002 ◽

Author(s):

J. Luettin ◽

N.A. Thacker ◽

S.W. Beet

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Active Shape Models ◽

Shape Models ◽

Visual Speech Recognition ◽

Download Full-text

Hybrid Simulated Annealing and Its Application to Optimization of Hidden Markov Models for Visual Speech Recognition

IEEE Transactions on Systems Man and Cybernetics Part B (Cybernetics) ◽

10.1109/tsmcb.2009.2036753 ◽

2010 ◽

Vol 40 (4) ◽

pp. 1188-1196 ◽

Author(s):

Jong-Seok Lee ◽

Cheol Hoon Park

Keyword(s):

Simulated Annealing ◽

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Visual Speech Recognition ◽

Hybrid Simulated Annealing

Download Full-text

Audio-visual speech asynchrony detection using co-inertia analysis and coupled hidden markov models

Pattern Analysis and Applications ◽

10.1007/s10044-008-0121-2 ◽

2008 ◽

Vol 12 (3) ◽

pp. 271-284 ◽

Author(s):

Enrique Argones Rúa ◽

Hervé Bredin ◽

Carmen García Mateo ◽

Gérard Chollet ◽

Daniel González Jiménez

Keyword(s):

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Coupled Hidden Markov Models ◽

Asynchrony Detection

Download Full-text

Audio-visual speech modeling using coupled hidden Markov models

IEEE International Conference on Acoustics Speech and Signal Processing ◽

10.1109/icassp.2002.5745026 ◽

2002 ◽

Author(s):

Stephen M. Chu ◽

Thomas S. Huang

Keyword(s):

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Coupled Hidden Markov Models ◽

Speech Modeling

Download Full-text

Audio-visual speech modeling using coupled hidden Markov models

IEEE International Conference on Acoustics Speech and Signal Processing ◽

10.1109/icassp.2002.1006166 ◽

2002 ◽

Author(s):

Chu ◽

Huang

Keyword(s):

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Visual Speech ◽

Coupled Hidden Markov Models ◽

Speech Modeling

Download Full-text