Speaker identification with vector quantization and k-harmonic means

Author(s):  
Mustafa Yazici ◽  
Mustafa Ulutas
2018 ◽  
Author(s):  
Purwono Prasetyawan

Biometrics is a technology for physical analysis and human behavior used in authentication. One of the behavioral characteristics associated with a person is sound. A person's voice can be identified based on the person's voice signal characteristics. There are several methods in recognizing speaker sound, such as with Mel Frequency Cepstrum Calculation (MFCC) and Subband Based Cepstral (SBC). This study looked for the effectiveness of the use of MFCC and SBC feature extraction with LBG Vector Quantization matching characteristics. Effective feature extraction methods will be tested for realtime speaker identification. The results obtained from this research is the value of MFCC 32 coefficient more effective than the SBC accurately and the speed of the process of identification of both text-dependent and text-independent speaker. The test results of speaker identification in realtime using MFCC is still not satisfactory because the accuracy of recognition is still below 70%.


2020 ◽  
Vol 17 (3(Suppl.)) ◽  
pp. 1019
Author(s):  
Bassel Alkhatib ◽  
Mohammad Madian Waleed Kamal Eddin

The speaker identification is one of the fundamental problems in speech processing and voice modeling. The speaker identification applications include authentication in critical security systems and the accuracy of the selection. Large-scale voice recognition applications are a major challenge. Quick search in the speaker database requires fast, modern techniques and relies on artificial intelligence to achieve the desired results from the system. Many efforts are made to achieve this through the establishment of variable-based systems and the development of new methodologies for speaker identification. Speaker identification is the process of recognizing who is speaking using the characteristics extracted from the speech's waves like pitch, tone, and frequency. The speaker's models are created and saved in the system environment and used to verify the identity required by people accessing the systems, which allows access to various services that are controlled by voice, speaker identification involves two main parts: the first part is the feature extraction and the second part is the feature matching.


In this paper, the systems of speaker identification of a text-dependent and independent nature were considered. Feature extraction was performed using chalk-frequency cepstral coefficients (MFCC). The vector quantization method for the automatic identification of a person by voice has been investigated. Using the extracted features, the code book from each speaker was built by clustering the feature vectors. Speakers were modeled using vector quantization (VQ). Using the extracted features, the code book from each speaker was built by clustering the feature vectors. Codebooks of all announcers were collected in the database. From the results, it can be said that vector quantization using cepstral features produces good results for creating a voice recognition system.


2004 ◽  
Author(s):  
Mohammed Abu El-Yazeed ◽  
Nemat Abdel Kader ◽  
Mohammed El-Henawy

Sign in / Sign up

Export Citation Format

Share Document