Performance of speaker recognition system using shifted mfcc, delta spectral cepstral coefficient (DSCC) and Fuzzy techniques

Speech and speaker recognition systems are biometric inspired systems which are having scope in various online and offline applications. In case of biometric we ponder the variability of speech signal due to the presence of noise which greatly degrades the efficiency of Automatic Speaker Recognition (ASR) in real-world environmental circumstances. Real world speech signal is degraded by different types of noise signals like background noise, interference noise and crosstalk noise. In this paper, we have used Delta Spectrum Cepstrum Coefficient (DSCC) and Shifted MFCC with fuzzy modeling techniques to rectify the deed of ASR even in a noisy surrounding with the help of upgraded speech information which is present at high frequency in the spectral domain. The combination of fuzzy modeling and DSCC creates a firm cumulative algorithm which has reasonably high robustness to noise. Experimental results show that accuracy has enhanced by 10-20% even at 5-8dB SNR in the presence of background noise or turbulent environmental condition or in the presence of white noise.Thus proposed model has improved maturity level in comparison to obsolete methods.

Download Full-text

Automatic Speaker Recognition System

10.21236/ada197980 ◽

1984 ◽

Author(s):

Alan Higgins ◽

Joe Naylor

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Automatic Speaker Recognition

Download Full-text

Automatic Speaker Recognition by Speech Signal

Frontiers in Robotics, Automation and Control ◽

10.5772/6333 ◽

2008 ◽

Cited By ~ 2

Author(s):

Milan Sigmund

Keyword(s):

Speaker Recognition ◽

Speech Signal ◽

Automatic Speaker Recognition

Download Full-text

The assessment of efficiency of the automatic speaker recognition system for voices registered using a throat microphone

XII Conference on Reconnaissance and Electronic Warfare Systems ◽

10.1117/12.2524591 ◽

2019 ◽

Author(s):

Kamil Kamiński ◽

Andrzej P. Dobrowolski ◽

Rafał Tatoń

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Automatic Speaker Recognition

Download Full-text

Automatic Speaker Recognition System based on Optimised Machine Learning Algorithms

2019 IEEE AFRICON ◽

10.1109/africon46755.2019.9133823 ◽

2019 ◽

Author(s):

Tumisho Billson Mokgonyane ◽

Tshephisho Joseph Sefara ◽

Thipe Isaiah Modipa ◽

Madimetja Jonas Manamela

Keyword(s):

Machine Learning ◽

Speaker Recognition ◽

Learning Algorithms ◽

Recognition System ◽

Machine Learning Algorithms ◽

Automatic Speaker Recognition

Download Full-text

Text Independent Automatic Speaker Recognition System using fusion of features

PRZEGLĄD ELEKTROTECHNICZNY ◽

10.15199/48.2015.10.52 ◽

2015 ◽

Vol 1 (10) ◽

pp. 249-253 ◽

Cited By ~ 1

Author(s):

Ewelina MAJDA-ZDANCEWICZ

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Automatic Speaker Recognition

Download Full-text

THE AUTOMATIC SPEAKER RECOGNITION SYSTEM OF CRITICAL USE CLASSIFIER OPTIMIZATION

Radio Electronics Computer Science Control ◽

10.15588/1607-3274-2018-2-4 ◽

2018 ◽

Vol 0 (2) ◽

Cited By ~ 1

Author(s):

O. V Bisikalo ◽

T. V. Grischuk ◽

V. V. Kovtun

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Automatic Speaker Recognition

Download Full-text

Bayesian distance metric learning and its application in automatic speaker recognition systems

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i4.pp2960-2967 ◽

2019 ◽

Vol 9 (4) ◽

pp. 2960

Author(s):

Satyanand Singh

Keyword(s):

Distance Learning ◽

Covariance Matrix ◽

Speaker Recognition ◽

Metric Learning ◽

Recognition System ◽

Training Data ◽

Distance Metric ◽

Automatic Speaker Recognition ◽

Data Pair ◽

Metric Distance

This paper proposes state-of the-art Automatic Speaker Recognition System (ASR) based on Bayesian Distance Learning Metric as a feature extractor. In this modeling, I explored the constraints of the distance between modified and simplified i-vector pairs by the same speaker and different speakers. An approximation of the distance metric is used as a weighted covariance matrix from the higher eigenvectors of the covariance matrix, which is used to estimate the posterior distribution of the metric distance. Given a speaker tag, I select the data pair of the different speakers with the highest cosine score to form a set of speaker constraints. This collection captures the most discriminating variability between the speakers in the training data. This Bayesian distance learning approach achieves better performance than the most advanced methods. Furthermore, this method is insensitive to normalization compared to cosine scores. This method is very effective in the case of limited training data. The modified supervised i-vector based ASR system is evaluated on the NIST SRE 2008 database. The best performance of the combined cosine score EER 1.767% obtained using LDA200 + NCA200 + LDA200, and the best performance of Bayes_dml EER 1.775% obtained using LDA200 + NCA200 + LDA100. Bayesian_dml overcomes the combined norm of cosine scores and is the best result of the short2-short3 condition report for NIST SRE 2008 data.

Download Full-text