Speech Emotion Recognition Based on Mixed MFCC

2012 ◽  
Vol 249-250 ◽  
pp. 1252-1258 ◽  
Author(s):  
Ping Zhou ◽  
Xiao Pan Li ◽  
Jie Li ◽  
Xin Xing Jing

Due to MFCC characteristic parameter in speech recognition has low identification accuracy when signal is intermediate, high frequency signal, this paper put forward a improved algorithm of combining MFCC, Mid-MFCC and IMFCC, using increase or decrease component method to calculate the contribution that MFCC, Mid-MFCC and IMFCC each order cepstrum component was used in speech emotion recognition, extracting several order cepstrum component with highest contribution from three characteristic parameters and forming a new characteristic parameter. The experiment results show that under the same environment new characteristic parameter has higher recognition rate than classic MFCC characteristic parameter in speech emotion recognition.

2014 ◽  
Vol 571-572 ◽  
pp. 665-671 ◽  
Author(s):  
Sen Xu ◽  
Xu Zhao ◽  
Cheng Hua Duan ◽  
Xiao Lin Cao ◽  
Hui Yan Li ◽  
...  

As One of Features from other Languages, the Chinese Tone Changes of Chinese are Mainly Decided by its Vowels, so the Vowel Variation of Chinese Tone Becomes Important in Speech Recognition Research. the Normal Tone Recognition Ways are Always Based on Fundamental Frequency of Signal, which can Not Keep Integrity of Tone Signal. we Bring Forward to a Mathematical Morphological Processing of Spectrograms for the Tone of Chinese Vowels. Firstly, we will have Pretreatment to Recording Good Tone Signal by Using Cooledit Pro Software, and Converted into Spectrograms; Secondly, we will do Smooth and the Normalized Pretreatment to Spectrograms by Mathematical Morphological Processing; Finally, we get Whole Direction Angle Statistics of Tone Signal by Skeletonization way. the Neural Networks Stimulation Shows that the Speech Emotion Recognition Rate can Reach 92.50%.


2014 ◽  
Vol 543-547 ◽  
pp. 2192-2195 ◽  
Author(s):  
Chen Chen Huang ◽  
Wei Gong ◽  
Wen Long Fu ◽  
Dong Yu Feng

As the most important medium of communication in human beings life, speech carries abundant emotional information. In recent years, how to recognize the speakers emotional state automatically from the speech is attracting extensive attention of researchers in various fields. In this paper, we studied the method of speech emotion recognition. We collected a total of 360 sentences from four speakers with the emotional statement about happiness, anger, surprise, sadness, and extracted eight emotional characteristics from these voice data. Contribution analysis method is proposed to determine the value of emotion characteristic parameters. We also have used the weighted Euclidean distance template matching to identify the speech emotion, got more than 80% of the average emotional recognition rate.


Complexity ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Dan Li

With the development of virtual scenes, the degree of simulation and functions of virtual reality have been very complete, providing a new platform and perspective for teaching design. Firstly, the hidden Markov chain model is used to perform emotion recognition on English speech signals. English speech emotion recognition and speech semantic recognition are essentially the same. Hidden Markov style has been widely used in English speech semantic recognition. The experiments of feature extraction and pattern recognition of speech samples prove that Hidden Markovian has higher recognition rate and better recognition effect in speech emotion recognition. Secondly, combining the human pronunciation model and the hearing model, by analyzing the impact of the glottis feature on the human ear hearing-model feature, the research application of the English speech recognition emotion interactive simulation system uses the glottis feature to compensate the human ear, hearing feature is proposed by compensated English speech recognition, and emotion interaction simulation system is used in the English speech emotion experiment, which has obtained a high recognition rate and showed excellent performance.


2014 ◽  
Vol 2014 ◽  
pp. 1-7 ◽  
Author(s):  
Chenchen Huang ◽  
Wei Gong ◽  
Wenlong Fu ◽  
Dongyu Feng

Feature extraction is a very important part in speech emotion recognition, and in allusion to feature extraction in speech emotion recognition problems, this paper proposed a new method of feature extraction, using DBNs in DNN to extract emotional features in speech signal automatically. By training a 5 layers depth DBNs, to extract speech emotion feature and incorporate multiple consecutive frames to form a high dimensional feature. The features after training in DBNs were the input of nonlinear SVM classifier, and finally speech emotion recognition multiple classifier system was achieved. The speech emotion recognition rate of the system reached 86.5%, which was 7% higher than the original method.


2016 ◽  
Vol 39 (8) ◽  
pp. 1205-1215 ◽  
Author(s):  
Bahram Mohammadi ◽  
Mohammad Reza Arvan ◽  
Yousof Koohmaskan

Rolling airframe manoeuvring is a type of manoeuvre in which the missile provides continuous roll during flight. Cross-coupling between the angle of attack and sideslip in rolling airframe missiles (RAMs) yields a coning motion around the flight path. As the pitch and yaw cross-coupling effect decreases, the radius of this coning motion decreases and the accuracy of the control system increases. Two-position (on–off) actuators are used in most RAMs. The presence of a two-position actuator in a feedback system makes its characteristics non-linear. A high-frequency signal so-called dither is applied to compensate for the non-linearity effect of the actuator characteristic in the feedback system and to stabilize the coning motion. The amplitude distribution function (ADF) method in dither analysis shows that the smoothed non-linearity characteristic can be computed as the convolution of the original non-linearity and the ADF of the dither signal. According to the four-degrees-of-freedom (4-DOF) equations of RAMs in a non-rolling frame and regarding various dither signals through the ADF approach on a two-position actuator, an analytical condition for dither amplitude in coning motion stability of RAMs is derived. It was shown that the triangular signal with specified amplitude and high enough frequency led to a smoother response of two-position actuators. Finally, by applying beam-riding guidance to a RAM, the performance of dithers for decreasing the distance of the missile from the centre of the beam is validated through simulations. It is illustrated that applying the triangular dither resulted in minimal error.


Sign in / Sign up

Export Citation Format

Share Document