Speaker Recognition based on Mel-Frequency Cepstral Coefficients and Vector Quantization

In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method.

Download Full-text

The Teager-Kaiser Energy Cepstral Coefficients as an Effective Structural Health Monitoring Tool

Applied Sciences ◽

10.3390/app9235064 ◽

2019 ◽

Vol 9 (23) ◽

pp. 5064 ◽

Cited By ~ 5

Author(s):

Marco Civera ◽

Matteo Ferraris ◽

Rosario Ceravolo ◽

Cecilia Surace ◽

Raimondo Betti

Keyword(s):

Experimental Data ◽

Structural Health Monitoring ◽

Health Monitoring ◽

Speech Processing ◽

Speaker Recognition ◽

Vibration Analysis ◽

Monitoring Tool ◽

Mel Frequency Cepstral Coefficients ◽

Structural Health ◽

Cepstral Coefficients

Recently, features and techniques from speech processing have started to gain increasing attention in the Structural Health Monitoring (SHM) community, in the context of vibration analysis. In particular, the Cepstral Coefficients (CCs) proved to be apt in discerning the response of a damaged structure with respect to a given undamaged baseline. Previous works relied on the Mel-Frequency Cepstral Coefficients (MFCCs). This approach, while efficient and still very common in applications, such as speech and speaker recognition, has been followed by other more advanced and competitive techniques for the same aims. The Teager-Kaiser Energy Cepstral Coefficients (TECCs) is one of these alternatives. These features are very closely related to MFCCs, but provide interesting and useful additional values, such as e.g., improved robustness with respect to noise. The goal of this paper is to introduce the use of TECCs for damage detection purposes, by highlighting their competitiveness with closely related features. Promising results from both numerical and experimental data were obtained.

Download Full-text

Text independent speaker recognition using the mel frequency cepstral coefficients and a neural network classifier

First International Symposium on Control, Communications and Signal Processing, 2004. ◽

10.1109/isccsp.2004.1296479 ◽

2004 ◽

Cited By ~ 10

Author(s):

H. Seddik ◽

A. Rahmouni ◽

M. Sayadi

Keyword(s):

Neural Network ◽

Speaker Recognition ◽

Neural Network Classifier ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Pengenalan Pembicara untuk Menentukan Gender Menggunakan Metode MFCC dan VQ

MIND Journal ◽

10.26760/mindjournal.v2i1.34-47 ◽

2018 ◽

Vol 2 (1) ◽

pp. 34-47

Author(s):

Youllia Indrawaty N ◽

Andri Ana ◽

Dita Permatasari

Keyword(s):

Vector Quantization ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Klasifikasi suara berdasarkan gender dibuat dengan tujuan agar komputer mampu mengenali suara laki-laki dan perempuan. Dengan kemampuan komputer yang mampu membedakan suara laki-laki dan perempuan pada pengembangan selanjutnya akan memperkuat tingkat suatu sistem keamanan yang menggunakan password dengan suara. Penelitian ini mengenai pengenalan gender dari pengucap/ pembicara dengan ucapan bergantung teks dan bergantung pembicara, dalam proses pengenalan tersebut digunakan algoritma ekstraksi yang disebut Mel Frequency Cepstral Coefficients (MFCC) digunakan untuk ekstraksi ciri dari sinyal wicara sedangkan proses pengelompokan menggunakan metode Vector Quantization (VQ). Dalam tahap pengenalan, ukuran distorsi berdasarkan minimisasi jarak Euclidean digunakan untuk mencocokkan penutur uji dengan penutur dalam database. Database wicara menggunakan 20 penutur, terdiri dari 10 penutur pria dan 10 penutur wanita dengan tingkat akurasi pria mencapai 90% dan wanita 80%.

Download Full-text

Gammachirp Filter Banks Applied in Roust Speaker Recognition Based GMM-UBM Classifier

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/2/4 ◽

2019 ◽

Vol 17 (2) ◽

pp. 170-177

Author(s):

Lei Deng ◽

Yong Gao

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Recognition Performance ◽

Recognition System ◽

Cube Root ◽

Mel Frequency Cepstral Coefficients ◽

Feature Extraction Algorithm ◽

Extraction Algorithm ◽

Auditory Feature ◽

Cepstral Coefficients

In this paper, authors propose an auditory feature extraction algorithm in order to improve the performance of the speaker recognition system in noisy environments. In this auditory feature extraction algorithm, the Gammachirp filter bank is adapted to simulate the auditory model of human cochlea. In addition, the following three techniques are applied: cube-root compression method, Relative Spectral Filtering Technique (RASTA), and Cepstral Mean and Variance Normalization algorithm (CMVN).Subsequently, based on the theory of Gaussian Mixes Model-Universal Background Model (GMM-UBM), the simulated experiment was conducted. The experimental results implied that speaker recognition systems with the new auditory feature has better robustness and recognition performance compared to Mel-Frequency Cepstral Coefficients(MFCC), Relative Spectral-Perceptual Linear Predictive (RASTA-PLP),Cochlear Filter Cepstral Coefficients (CFCC) and gammatone Frequency Cepstral Coefficeints (GFCC)

Download Full-text

MEL-FREQUENCY CEPSTRAL COEFFICIENTS FOR SPEAKER RECOGNITION : A REVIEW

International Journal of Advance Engineering and Research Development ◽

10.21090/ijaerd.0205157 ◽

2015 ◽

Vol 2 (05) ◽

Cited By ~ 1

Keyword(s):

Speaker Recognition ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

SISTEM PENGENALAN PENUTUR DENGAN METODE MEL-FREQUENCY WRAPPING DAN KUANTISASI VEKTOR

MATICS ◽

10.18860/mat.v0i0.1564 ◽

2012 ◽

Author(s):

Ali Mustofa

Keyword(s):

Vector Quantization ◽

Filter Bank ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Pengenalan penutur adalah proses identifikasi suara seseorang.. Pengenalan penutur berguna untuk otentikasi biometrik dan komunikasi antara komputer dengan manusia. Teknik Mel Frequency Cepstral Coefficients (MFCC) digunakan untuk ekstraksi ciri dari sinyal wicara dan membandingkan dengan penutur tak dikenal dengan penutur yang ada dalam database. Filter bank digunakan sebagai pembungkus (wrapping) mel frekuensi. Vector Quantization (VQ) adalah proses meletakkan vektor-vektor ciri yang besar dan menghasilkan ukuran vektor-vektor yang kecil yang berhubungan dengan distribusi centroid. Algoritma K-mean digunakan untuk kluster. Dalam tahap pengenalan, ukuran distorsi berdasarkan minimisasi jarak Euclidean digunakan untuk mencocokkan penutur tak dikenal dengan penutur dalam database. Database wicara menggunakan 10 penutur berbeda dengan MFCC 12, 20 codebook, dan 16 centroid. Kata kunci: penutur, Mel Frequency Cepstral Coefficients, Vector Quantization, K-mean

Download Full-text

Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351) ◽

10.1109/scft.1999.781521 ◽

2003 ◽

Cited By ~ 20

Author(s):

N. Enbom ◽

W.B. Kleijn

Keyword(s):

Vector Quantization ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Bandwidth Expansion

Download Full-text