Speaker Verification with Fuzzy Fusion and Genetic Optimization

Author(s):  
Tuan Pham ◽  
◽  
Michael Wagner ◽  

Most speaker verification systems are based on similarity or likelihood normalization techniques as they help to better cope with speaker variability. In the conventional normalization, the it a priori probabilities of the cohort speakers are assumed to be equal. From this standpoint, we apply the fuzzy integral and genetic algorithms to combine the likelihood values of the cohort speakers in which the assumption of equal <I>a priori</I> probabilities is relaxed. This approach replaces the conventional normalization term by the fuzzy integral which acts as a non-linear fusion of the similarity measures of an utterance assigned to the cohort speakers. Furthermore, genetic algorithms are applied to find optimal fuzzy densities which are very important for the fuzzy fusion. We illustrate the performance of the proposed approach by testing the speaker verification system with both the conventional and the proposed algorithms using the commercial speech corpus TI46. The results in terms of the equal error rates show that the speaker verification system using the fuzzy integral is more favorable than the conventional normalization method.

2004 ◽  
Vol 14 (06) ◽  
pp. 347-354
Author(s):  
LI GUOJIE ◽  
P. SARATCHANDRAN ◽  
N. SUNDARARAJAN

This paper presents a text-independent speaker verification system based on an online Radial Basis Function (RBF) network referred to as Minimal Resource Allocation Network (MRAN). MRAN is a sequential learning RBF, in which hidden neurons are added or removed as training progresses. LP-derived cepstral coefficients are used as feature vectors during training and verification phases. The performance of MRAN is compared with other well-known RBF and Elliptical Basis Function (EBF) based speaker verification methods in terms of error rates and computational complexity on a series of speaker verification experiments. The experiments use data from 258 speakers from the phonetically balancedcontinuous speech corpus TIMIT. The results show that MRAN produces comparable error rates to other methods with much less computational complexity.


2020 ◽  
Author(s):  
Ying Tong ◽  
Wei Xue ◽  
Shanluo Huang ◽  
Lu Fan ◽  
Chao Zhang ◽  
...  

2020 ◽  
Author(s):  
Kong Aik Lee ◽  
Koji Okabe ◽  
Hitoshi Yamamoto ◽  
Qiongqiong Wang ◽  
Ling Guo ◽  
...  

Author(s):  
Soonshin Seo ◽  
Daniel Jun Rim ◽  
Minkyu Lim ◽  
Donghyun Lee ◽  
Hosung Park ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document