Segmental Analysis of Speech Signal for Robust Speaker Recognition System

Voice is one of the parameters in the identification process of a person. Through the voice, information will be obtained such as gender, age, and even the identity of the speaker. Speaker recognition is a method to narrow down crimes and frauds committed by voice. So that it will minimize the occurrence of faking one's identity. The Method of Mel Frequency Cepstrum Coefficient (MFCC) can be used in the speech recognition system. The process of feature extraction of speech signal using MFCC will produce acoustic speech signal. The classification, Hidden Markov Models (HMM) is used to match unidentified speaker’s voice with the voices in database. In this research, the system is used to verify the speaker, namely 15 text dependent in Indonesian. On testing the speaker with the same as database, the highest accuracy is 99,16%.

Download Full-text

Robust speaker recognition system employing covariance matrix and Eigenvoice

2013 IEEE 56th International Midwest Symposium on Circuits and Systems (MWSCAS) ◽

10.1109/mwscas.2013.6674848 ◽

2013 ◽

Author(s):

Genevieve I. Sapijaszko ◽

Wasfy B. Mikhael

Keyword(s):

Covariance Matrix ◽

Speaker Recognition ◽

Recognition System ◽

Robust Speaker Recognition

Download Full-text

Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition

Computer Speech & Language ◽

10.1016/j.csl.2019.06.004 ◽

2019 ◽

Vol 58 ◽

pp. 403-421 ◽

Cited By ~ 5

Author(s):

Ondřej Novotný ◽

Oldřich Plchot ◽

Ondřej Glembek ◽

Jan “Honza” Černocký ◽

Lukáš Burget

Keyword(s):

Speaker Recognition ◽

Speech Signal ◽

Signal Enhancement ◽

Robust Speaker Recognition

Download Full-text

Performance of speaker recognition system using shifted mfcc, delta spectral cepstral coefficient (DSCC) and Fuzzy techniques

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.8.10424 ◽

2018 ◽

Vol 7 (2.8) ◽

pp. 278

Author(s):

Priyanka Bansal ◽

Syed Akhtar Imam

Keyword(s):

Real World ◽

Speaker Recognition ◽

Speech Signal ◽

Background Noise ◽

Recognition System ◽

Fuzzy Modeling ◽

Crosstalk Noise ◽

Noise Interference ◽

Automatic Speaker Recognition ◽

Fuzzy Techniques

Speech and speaker recognition systems are biometric inspired systems which are having scope in various online and offline applications. In case of biometric we ponder the variability of speech signal due to the presence of noise which greatly degrades the efficiency of Automatic Speaker Recognition (ASR) in real-world environmental circumstances. Real world speech signal is degraded by different types of noise signals like background noise, interference noise and crosstalk noise. In this paper, we have used Delta Spectrum Cepstrum Coefficient (DSCC) and Shifted MFCC with fuzzy modeling techniques to rectify the deed of ASR even in a noisy surrounding with the help of upgraded speech information which is present at high frequency in the spectral domain. The combination of fuzzy modeling and DSCC creates a firm cumulative algorithm which has reasonably high robustness to noise. Experimental results show that accuracy has enhanced by 10-20% even at 5-8dB SNR in the presence of background noise or turbulent environmental condition or in the presence of white noise.Thus proposed model has improved maturity level in comparison to obsolete methods.

Download Full-text

Binaural Classification-Based Speech Segregation and Robust Speaker Recognition System

Circuits Systems and Signal Processing ◽

10.1007/s00034-017-0712-5 ◽

2017 ◽

Vol 37 (8) ◽

pp. 3383-3411 ◽

Cited By ~ 3

Author(s):

R. Venkatesan ◽

A. Balaji Ganesh

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Speech Segregation ◽

Robust Speaker Recognition

Download Full-text

A robust speaker recognition system combining factor analysis techniques

2014 21th Iranian Conference on Biomedical Engineering (ICBME) ◽

10.1109/icbme.2014.7043948 ◽

2014 ◽

Author(s):

Shaghayegh Reza ◽

Tahereh Emami Azadi ◽

Jahanshah Kabudian ◽

Yaser Shekofteh

Keyword(s):

Factor Analysis ◽

Speaker Recognition ◽

Recognition System ◽

Analysis Techniques ◽

Robust Speaker Recognition

Download Full-text

Multitaper Based MFCC Feature Extraction for Robust Speaker Recognition System

2019 Innovations in Power and Advanced Computing Technologies (i-PACT) ◽

10.1109/i-pact44901.2019.8960206 ◽

2019 ◽

Cited By ~ 2

Author(s):

K.P. Bharath ◽

Rajesh Kumar M.

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Recognition System ◽

Robust Speaker Recognition

Download Full-text

Cost-Sensitive Learning for Emotion Robust Speaker Recognition

The Scientific World JOURNAL ◽

10.1155/2014/628516 ◽

2014 ◽

Vol 2014 ◽

pp. 1-9 ◽

Cited By ~ 6

Author(s):

Dongdong Li ◽

Yingchun Yang ◽

Weihui Dai

Keyword(s):

Speaker Recognition ◽

Recognition System ◽

Learning Technology ◽

Speech Corpus ◽

Voice Communication ◽

Cost Sensitive Learning ◽

Identification Rate ◽

Telephone System ◽

Robust Speaker Recognition ◽

Voice Data

In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved.

Download Full-text