scholarly journals Gammachirp Filter Banks Applied in Roust Speaker Recognition Based GMM-UBM Classifier

2019 ◽  
Vol 17 (2) ◽  
pp. 170-177
Author(s):  
Lei Deng ◽  
Yong Gao

In this paper, authors propose an auditory feature extraction algorithm in order to improve the performance of the speaker recognition system in noisy environments. In this auditory feature extraction algorithm, the Gammachirp filter bank is adapted to simulate the auditory model of human cochlea. In addition, the following three techniques are applied: cube-root compression method, Relative Spectral Filtering Technique (RASTA), and Cepstral Mean and Variance Normalization algorithm (CMVN).Subsequently, based on the theory of Gaussian Mixes Model-Universal Background Model (GMM-UBM), the simulated experiment was conducted. The experimental results implied that speaker recognition systems with the new auditory feature has better robustness and recognition performance compared to Mel-Frequency Cepstral Coefficients(MFCC), Relative Spectral-Perceptual Linear Predictive (RASTA-PLP),Cochlear Filter Cepstral Coefficients (CFCC) and gammatone Frequency Cepstral Coefficeints (GFCC)

Author(s):  
Musab T. S. Al-Kaltakchi ◽  
Haithem Abd Al-Raheem Taha ◽  
Mohanad Abd Shehab ◽  
Mohamed A.M. Abdullah

<p><span lang="EN-GB">In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method. </span></p>


Robotica ◽  
1992 ◽  
Vol 10 (3) ◽  
pp. 241-254
Author(s):  
M. Mehdian

SUMMARYA binary tactile image feature extraction algorithm using image primitive notation and perceptrons is presented. The basic image segments are defined as geometric factors by which the image structure is described so that effective feature values such as image shape, image size, perimeter and texture may be extracted on the basis of local image computation. The local property of the tactile image computation is evaluated by the concept called order of the perceptrons and based on this feature extraction algorithm, an efficient tactile image recognition system is realised.


2014 ◽  
Vol 519-520 ◽  
pp. 577-580
Author(s):  
Shuai Yuan ◽  
Guo Yun Zhang ◽  
Jian Hui Wu ◽  
Long Yuan Guo

Fingerprint image feature extraction is a critical step to fingerprint recognition system, which studies topological structure, mathematical model and extraction algorithm of fingerprint feature. This paper presents system design and realization of feature extraction algorithm for fingerprint image. On the basis of fingerprint skeleton image, feature points including ending points, bifurcation points and singular points are extracted at first. Then false feature points are detected and eliminated by the violent changes of ambient orientation field. True feature points are marked at last. Test result shows that the method presented has good accuracy, quick speed and strong robustness for realtime application.


2020 ◽  
Vol 10 (2) ◽  
pp. 5547-5553
Author(s):  
A. A. Alasadi ◽  
T. H. Aldhayni ◽  
R. R. Deshmukh ◽  
A. H. Alahmadi ◽  
A. S. Alshebami

This paper studies three feature extraction methods, Mel-Frequency Cepstral Coefficients (MFCC), Power-Normalized Cepstral Coefficients (PNCC), and Modified Group Delay Function (ModGDF) for the development of an Automated Speech Recognition System (ASR) in Arabic. The Support Vector Machine (SVM) algorithm processed the obtained features. These feature extraction algorithms extract speech or voice characteristics and process the group delay functionality calculated straight from the voice signal. These algorithms were deployed to extract audio forms from Arabic speakers. PNCC provided the best recognition results in Arabic speech in comparison with the other methods. Simulation results showed that PNCC and ModGDF were more accurate than MFCC in Arabic speech recognition.


Sign in / Sign up

Export Citation Format

Share Document