scholarly journals Prosodic-Enhanced Siamese Convolutional Neural Networks for Cross-Device Text-Independent Speaker Verification

Author(s):  
Sobhan Soleymani ◽  
Ali Dabouei ◽  
Seyed Mehdi Iranmanesh ◽  
Hadi Kazemi ◽  
Jeremy Dawson ◽  
...  
THE BULLETIN ◽  
2020 ◽  
Vol 5 (387) ◽  
pp. 6-15
Author(s):  
O. Mamyrbayev ◽  
◽  
A. Akhmediyarova ◽  
A. Kydyrbekova ◽  
N. O. Mekebayev ◽  
...  

Biometrics offers more security and convenience than traditional methods of identification. Recently, DNN has become a means of a more reliable and efficient authentication scheme. In this work, we compare two modern teaching methods: these two methods are methods based on the Gaussian mixture model (GMM) (denoted by the GMM i-vector) and methods based on deep neural networks (DNN) (denoted as the i-vector DNN). The results show that the DNN system with an i-vector is superior to the GMM system with an i-vector for various durations (from full length to 5s). DNNs have proven to be the most effective features for text-independent speaker verification in recent studies. In this paper, a new scheme is proposed that allows using DNN when checking text using hints in a simple and effective way. Experiments show that the proposed scheme reduces EER by 24.32% compared with the modern method and is evaluated for its reliability using noisy data, as well as data collected in real conditions. In addition, it is shown that the use of DNN instead of GMM for universal background modeling leads to a decrease in EER by 15.7%.


Sign in / Sign up

Export Citation Format

Share Document