automatic speaker recognition Recently Published Documents

Efficient Manner ◽

Feature Extraction Method ◽

Input Signals ◽

Minimum Dimension

In this research, we present an automatic speaker recognition system based on adaptive orthogonal transformations. To obtain the informative features with a minimum dimension from the input signals, we created an adaptive operator, which helped to identify the speaker’s voice in a fast and efficient manner. We test the efficiency and the performance of our method by comparing it with another approach, mel-frequency cepstral coefficients (MFCCs), which is widely used by researchers as their feature extraction method. The experimental results show the importance of creating the adaptive operator, which gives added value to the proposed approach. The performance of the system achieved 96.8% accuracy using Fourier transform as a compression method and 98.1% using Correlation as a compression method.

Voice Pathologies Classification Using GMM And SVM Classifiers

International Journal of Mathematics and Computers in Simulation ◽

10.46300/9102.2021.15.21 ◽

2021 ◽

Vol 15 ◽

pp. 110-114

Author(s):

Amara Fethi ◽

Fezari Mohamed

Keyword(s):

Speaker Recognition ◽

Gaussian Mixture ◽

Polynomial Kernel ◽

Support Vector ◽

Classification Rate ◽

Statistical Pattern ◽

Voice Pathologies ◽

Sensitivity Specificity

In this paper we investigate the proprieties of automatic speaker recognition (ASR) to develop a system for voice pathologies detection, where the model does not correspond to a speaker but it corresponds to group of patients who shares the same diagnostic. One of essential part in this topic is the database (described later), the samples voices (healthy and pathological) are chosen from a German database which contains many diseases, spasmodic dysphonia is proposed for this study. This problematic can be solved by statistical pattern recognition techniques where we have proposed the mel frequency cepstral coefficients (MFCC) to be modeled first, with gaussian mixture model (GMM) massively used in ASR then, they are modeled with support vector machine (SVM). The obtained results are compared in order to evaluate the more preferment classifier. The performance of each method is evaluated in a term of the accuracy, sensitivity, specificity. The best performance is obtained with 12 coefficientsMFCC, energy and second derivate along SVM with a polynomial kernel function, the classification rate is 90% for normal class and 93% for pathological class.This work is developed under MATLAB

A Comparison of the Accuracy of Dissen and Keshet’s (2016) DeepFormants and Traditional LPC Methods for Semi-Automatic Speaker Recognition

10.21437/interspeech.2021-1487 ◽

2021 ◽

Author(s):

Thomas Coy ◽

Vincent Hughes ◽

Philip Harrison ◽

Amelia J. Gully

Keyword(s):

Speaker Recognition ◽

Automatic Speaker Recognition

Robust speaker verification by combining MFCC and entrocy in noisy conditions

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i4.2957 ◽

2021 ◽

Vol 10 (4) ◽

pp. 2310-2319

Author(s):

Duraid Y. Mohammed ◽

Khamis Al-Karawi ◽

Ahmed Aljuboori

Keyword(s):

Speaker Recognition ◽

Speaker Verification ◽

Gaussian Mixture ◽

Robust Speaker Recognition ◽

Noisy Conditions ◽

New Feature ◽

Highly Correlated ◽

The Fourier Transform

Automatic speaker recognition may achieve remarkable performance in matched training and test conditions. Conversely, results drop significantly in incompatible noisy conditions. Furthermore, feature extraction significantly affects performance. Mel-frequency cepstral coefficients MFCCs are most commonly used in this field of study. The literature has reported that the conditions for training and testing are highly correlated. Taken together, these facts support strong recommendations for using MFCC features in similar environmental conditions (train/test) for speaker recognition. However, with noise and reverberation present, MFCC performance is not reliable. To address this, we propose a new feature 'entrocy' for accurate and robust speaker recognition, which we mainly employ to support MFCC coefficients in noisy environments. Entrocy is the fourier transform of the entropy, a measure of the fluctuation of the information in sound segments over time. Entrocy features are combined with MFCCs to generate a composite feature set which is tested using the gaussian mixture model (GMM) speaker recognition method. The proposed method shows improved recognition accuracy over a range of signal-to-noise ratios.

Can voice similarity be assessed using an automatic speaker recognition system?

10.33774/coe-2021-dqwkx ◽

2021 ◽

Author(s):

Linda Gerlach ◽

Kirsty McDougall ◽

Finnian Kelly ◽

Anil Alexander

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Automatic Speaker Recognition

2021 5th International Conference on Trends in Electronics and Informatics (ICOEI) ◽

A Comprehensive Study on Automatic Speaker Recognition by using Deep Learning Techniques

10.1109/icoei51242.2021.9452885 ◽

2021 ◽

Author(s):

Venkata Subba Reddy Gade ◽

M. Sumathi

Keyword(s):

Deep Learning ◽

Speaker Recognition ◽

Learning Techniques ◽

Comprehensive Study

Speaker Recognition Systems in the Last Decade – A Survey

Engineering and Technology Journal ◽

10.30684/etj.v39i1b.1589 ◽

2021 ◽

Vol 39 (1B) ◽

pp. 30-40

Author(s):

Ahmed M. Ahmed ◽

Aliaa K. Hassan

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Clustering Algorithms ◽

Predictive Coding ◽

Gaussian Mixture ◽

Linear Predictive Coding ◽

Voice Signal ◽

Authentication System

Speaker Recognition Defined by the process of recognizing a person by his\her voice through specific features that extract from his\her voice signal. An Automatic Speaker recognition (ASP) is a biometric authentication system. In the last decade, many advances in the speaker recognition field have been attained, along with many techniques in feature extraction and modeling phases. In this paper, we present an overview of the most recent works in ASP technology. The study makes an effort to discuss several modeling ASP techniques like Gaussian Mixture Model GMM, Vector Quantization (VQ), and Clustering Algorithms. Also, several feature extraction techniques like Linear Predictive Coding (LPC) and Mel frequency cepstral coefficients (MFCC) are examined. Finally, as a result of this study, we found MFCC and GMM methods could be considered as the most successful techniques in the field of speaker recognition so far.

An improved MMSE estimator based modified group delay spectrum for Forensic Automatic Speaker Recognition

International Journal of Speech Technology ◽

10.1007/s10772-021-09829-9 ◽

2021 ◽

Author(s):

Salim Djeghiour ◽

Mhania Guerti

Keyword(s):

Speaker Recognition ◽

Group Delay ◽

Mmse Estimator ◽

Modified Group Delay

Towards an objective comparison of feature extraction techniques for automatic speaker recognition systems

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v10i1.1782 ◽

2021 ◽

Vol 10 (1) ◽

pp. 374-382

Author(s):

Ayoub Bouziane ◽

Jamal Kharroubi ◽

Arsalane Zarghili

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Comparative Studies ◽

Features Extraction ◽

Modeling Technique ◽

Extraction Techniques ◽

Recognition Systems ◽

Speaker Modeling

A common limitation of the previous comparative studies on speaker-features extraction techniques lies in the fact that the comparison is done independently of the used speaker modeling technique and its parameters. The aim of the present paper is twofold. Firstly, it aims to review the most significant advancements in feature extraction techniques used for automatic speaker recognition. Secondly, it seeks to evaluate and compare the currently dominant ones using an objective comparison methodology that overcomes the various limitations and drawbacks of the previous comparative studies. The results of the carried out experiments underlines the importance of the proposed comparison methodology.

Automatic Speaker Recognition using Transfer Learning Approach of Deep Learning Models

2021 6th International Conference on Inventive Computation Technologies (ICICT) ◽

10.1109/icict50816.2021.9358539 ◽

2021 ◽

Author(s):

Sonal Ganvir ◽

Nidhi Lal

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Speaker Recognition ◽

Learning Approach ◽

Learning Models ◽

Automatic Speaker Recognition