New feature vector extraction method for speaker recognition

Author(s):  
Lyudmila Sukhostat ◽  
Yadigar Imamverdiyev
Author(s):  
A. Nagesh

The feature vectors of speaker identification system plays a crucial role in the overall performance of the system. There are many new feature vectors extraction methods based on MFCC, but ultimately we want to maximize the performance of SID system.  The objective of this paper to derive Gammatone Frequency Cepstral Coefficients (GFCC) based a new set of feature vectors using Gaussian Mixer model (GMM) for speaker identification. The MFCC are the default feature vectors for speaker recognition, but they are not very robust at the presence of additive noise. The GFCC features in recent studies have shown very good robustness against noise and acoustic change. The main idea is  GFCC features based on GMM feature extraction is to improve the overall speaker identification performance in low signal to noise ratio (SNR) conditions.


2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Shi Wang ◽  
Zhujun Wang ◽  
Yi Jiang ◽  
Huayu Wang

In the event extraction task, considering that there may be multiple scenarios in the corpus and an argument may play different roles under different triggers, the traditional tagging scheme can only tag each word once, which cannot solve the problem of argument overlap. A hierarchical tagging pipeline model for Chinese corpus based on the pretrained model Bert was proposed, which can obtain the relevant arguments of each event in a hierarchical way. The pipeline structure is selected in the model, and the event extraction task is divided into event trigger classification and argument recognition. Firstly, the pretrained model Bert is used to generate the feature vector and transfer it to bidirectional gated recurrent unit+conditional random field (BiGRU+CRF) model for trigger classification; then, the marked event type features are spliced into the corpus as known features and then passed into BiGRU+CRF for argument recognition. We evaluated our method on DUEE, combined with data enhancement and mask operation. Experimental results show that our method is improved compared with other baselines, which prove the effectiveness of the model in Chinese corpus.


Author(s):  
Qian Liu ◽  
Feng Yang ◽  
XiaoFen Tang

In view of the issue of the mechanism for enhancing the neighbourhood relationship of blocks of HOG, this paper proposes neighborhood descriptor of oriented gradients (NDOG), an improved feature descriptor based on HOG, for pedestrian detection. To obtain the NDOG feature vector, the algorithm calculates the local weight vector of the HOG feature descriptor, while integrating spatial correlation among blocks, concatenates this weight vector to the tail of the HOG feature descriptor, and uses the gradient norm to normalize this new feature vector. With the proposed NDOG feature vector along with a linear SVM classifier, this paper develops a complete pedestrian detection approach. Experimental results for the INRIA, Caltech-USA, and ETH pedestrian datasets show that the approach achieves a lower miss rate and a higher average precision compared with HOG and other advanced methods for pedestrian detection especially in the case of insufficient training samples.


Complexity ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-9
Author(s):  
Jiang Lin ◽  
Yi Yumei ◽  
Zhang Maosheng ◽  
Chen Defeng ◽  
Wang Chao ◽  
...  

In speaker recognition systems, feature extraction is a challenging task under environment noise conditions. To improve the robustness of the feature, we proposed a multiscale chaotic feature for speaker recognition. We use a multiresolution analysis technique to capture more finer information on different speakers in the frequency domain. Then, we extracted the speech chaotic characteristics based on the nonlinear dynamic model, which helps to improve the discrimination of features. Finally, we use a GMM-UBM model to develop a speaker recognition system. Our experimental results verified its good performance. Under clean speech and noise speech conditions, the ERR value of our method is reduced by 13.94% and 26.5% compared with the state-of-the-art method, respectively.


Author(s):  
Dewi Nasien ◽  
Deni Yulianti ◽  
Fakhrul Syakirin Omar ◽  
M. Hasmil Adiya ◽  
Yenny Desnelita ◽  
...  

2013 ◽  
Vol 2013 ◽  
pp. 1-8 ◽  
Author(s):  
Jianwei Yang ◽  
Guosheng Cheng ◽  
Ming Li

An approach based on fractal is presented for extracting affine invariant features. Central projection transformation is employed to reduce the dimensionality of the original input pattern, and general contour (GC) of the pattern is derived. Affine invariant features cannot be extracted from GC directly due to shearing. To address this problem, a group of curves (which are called shift curves) are constructed from the obtained GC. Fractal dimensions of these curves can readily be computed and constitute a new feature vector for the original pattern. The derived feature vector is used in question for pattern recognition. Several experiments have been conducted to evaluate the performance of the proposed method. Experimental results show that the proposed method can be used for object classification.


Sign in / Sign up

Export Citation Format

Share Document