scholarly journals Bayesian distance metric learning and its application in automatic speaker recognition systems

Author(s):  
Satyanand Singh

This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Distance Learning Metric as a feature extractor. In this modeling, I explored the constraints of the distance between modified and simplified i-vector pairs by the same speaker and different speakers. An approximation of the distance metric is used as a weighted covariance matrix from the higher eigenvectors of the covariance matrix, which is used to estimate the posterior distribution of the metric distance. Given a speaker tag, I select the data pair of the different speakers with the highest cosine score to form a set of speaker constraints. This collection captures the most discriminating variability between the speakers in the training data. This Bayesian distance learning approach achieves better performance than the most advanced methods. Furthermore, this method is insensitive to normalization compared to cosine scores. This method is very effective in the case of limited training data. The modified supervised i-vector based ASR system is evaluated on the NIST SRE 2008 database. The best performance of the combined cosine score EER 1.767% obtained using LDA200 + NCA200 + LDA200, and the best performance of Bayes_dml EER 1.775% obtained using LDA200 + NCA200 + LDA100. Bayesian_dml overcomes the combined norm of cosine scores and is the best result of the short2-short3 condition report for NIST SRE 2008 data.

2018 ◽  
Vol 7 (2.32) ◽  
pp. 405
Author(s):  
L Lavanya ◽  
Chebrolu Ujwala Pavani ◽  
Gadchanda Vineeth ◽  
Borada Lavanya

Distance learning is an eminent technique that improves the search for images based on content. Although widely studied, most DML approaches generally recognize a modalization training framework that teaches a metric distance or a combination of distances in which several types of characteristics are simply interconnected. DML methods of that type suffer some critical limitations (a) Some feature types can significantly overwhelm others with the DML assignment, due to different attributes, and (b) the distance learning standard in the combined metric properties can be consumed using the feature attribute approach combined. In this article we refer to these the restrictions are reviewed online- multimodal distance metric training scheme (OMDML), which explores a dual duplication online learning scheme. (c) learn to optimize the distance metric in each owner space separately; and (d) learn find the optimal combination of different types of characteristics. To overestimate the cost of DML in sophisticated areas, we offer a low level OMDML algorithm that not only reduces estimated costs, but also guarantees high accuracy. We are here carried out exhaustive experiments to estimate the performance of the algorithms proposed for the restoration of multimedia images.  


Author(s):  
Tumisho Billson Mokgonyane ◽  
Tshephisho Joseph Sefara ◽  
Thipe Isaiah Modipa ◽  
Madimetja Jonas Manamela

Author(s):  
DEBASHISH DEV MISHRA ◽  
UTPAL BHATTACHARJEE ◽  
SHIKHAR KUMAR SARMA

The performance of automatic speaker recognition (ASR) system degrades drastically in the presence of noise and other distortions, especially when there is a noise level mismatch between the training and testing environments. This paper explores the problem of speaker recognition in noisy conditions, assuming that speech signals are corrupted by noise. A major problem of most speaker recognition systems is their unsatisfactory performance in noisy environments. In this experimental research, we have studied a combination of Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Cepstral Mean Normalization (CMN) techniques for speech enhancement. Our system uses a Gaussian Mixture Models (GMM) classifier and is implemented under MATLAB®7 programming environment. The process involves the use of speaker data for both training and testing. The data used for testing is matched up against a speaker model, which is trained with the training data using GMM modeling. Finally, experiments are carried out to test the new model for ASR given limited training data and with differing levels and types of realistic background noise. The results have demonstrated the robustness of the new system.


Author(s):  
Shuo Chen ◽  
Chen Gong ◽  
Jian Yang ◽  
Ying Tai ◽  
Le Hui ◽  
...  

The central problem for most existing metric learning methods is to find a suitable projection matrix on the differences of all pairs of data points. However, a single unified projection matrix can hardly characterize all data similarities accurately as the practical data are usually very complicated, and simply adopting one global projection matrix might ignore important local patterns hidden in the dataset. To address this issue, this paper proposes a novel method dubbed “Data-Adaptive Metric Learning” (DAML), which constructs a data-adaptive projection matrix for each data pair by selectively combining a set of learned candidate matrices. As a result, every data pair can obtain a specific projection matrix, enabling the proposed DAML to flexibly fit the training data and produce discriminative projection results. The model of DAML is formulated as an optimization problem which jointly learns candidate projection matrices and their sparse combination for every data pair. Nevertheless, the over-fitting problem may occur due to the large amount of parameters to be learned. To tackle this issue, we adopt the Total Variation (TV) regularizer to align the scales of data embedding produced by all candidate projection matrices, and thus the generated metrics of these learned candidates are generally comparable. Furthermore, we extend the basic linear DAML model to the kernerlized version (denoted “KDAML”) to handle the non-linear cases, and the Iterative Shrinkage-Thresholding Algorithm (ISTA) is employed to solve the optimization model. Intensive experimental results on various applications including retrieval, classification, and verification clearly demonstrate the superiority of our algorithm to other state-of-the-art metric learning methodologies.


Sign in / Sign up

Export Citation Format

Share Document