Speaker identification based on the time-delay hierarchical mixture of experts

Author(s):  
Ke Chen ◽  
Dahong Xie ◽  
Huisheng Chi
1996 ◽  
Vol 07 (01) ◽  
pp. 29-43 ◽  
Author(s):  
KE CHEN ◽  
DAHONG XIE ◽  
HUISHENG CHI

In this paper, we extend the Hierarchical Mixture of Experts (HME) to temporal processing and explore it for a substantial problem, that of text-dependent speaker identification. For a specific multiway classification, we propose a generalized Bernoulli density instead of the multinomial logit density to avoid the instability during training. Time-delay technique is applied for spatio-temporal processing in the HME and a combining scheme is presented for combining multiple time-delay HMEs in order to complete a multi-scale analysis for the temporal data. Using the time-delay HME along with the EM algorithm as well as the combination of multiple time-delay HMEs, the speaker identification system has a good performance and yields significantly fast training. We have also addressed some issues about the time-delay techniques in the HME.


2006 ◽  
Vol 16 (4) ◽  
pp. 389-395 ◽  
Author(s):  
Woo-Kyung Choi ◽  
Sang-Hyung Ha ◽  
Seong-Joo Kim ◽  
Yong-Taek Kim ◽  
Hong-Tae Jeon

2021 ◽  
Vol 419 ◽  
pp. 148-156 ◽  
Author(s):  
Ozan İrsoy ◽  
Ethem Alpaydın

Sign in / Sign up

Export Citation Format

Share Document