Speech Emotion Recognition Based on Mixed MFCC

Due to MFCC characteristic parameter in speech recognition has low identification accuracy when signal is intermediate, high frequency signal, this paper put forward a improved algorithm of combining MFCC, Mid-MFCC and IMFCC, using increase or decrease component method to calculate the contribution that MFCC, Mid-MFCC and IMFCC each order cepstrum component was used in speech emotion recognition, extracting several order cepstrum component with highest contribution from three characteristic parameters and forming a new characteristic parameter. The experiment results show that under the same environment new characteristic parameter has higher recognition rate than classic MFCC characteristic parameter in speech emotion recognition.

Download Full-text

A Mathematical Morphological Processing of Spectrograms for the Tone of Chinese Vowels Recognition

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.571-572.665 ◽

2014 ◽

Vol 571-572 ◽

pp. 665-671 ◽

Cited By ~ 1

Author(s):

Sen Xu ◽

Xu Zhao ◽

Cheng Hua Duan ◽

Xiao Lin Cao ◽

Hui Yan Li ◽

...

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Emotion Recognition ◽

Recognition Rate ◽

Morphological Processing ◽

Speech Emotion Recognition ◽

Normal Tone ◽

Tone Recognition ◽

Tone Signal ◽

The Neural Networks

As One of Features from other Languages, the Chinese Tone Changes of Chinese are Mainly Decided by its Vowels, so the Vowel Variation of Chinese Tone Becomes Important in Speech Recognition Research. the Normal Tone Recognition Ways are Always Based on Fundamental Frequency of Signal, which can Not Keep Integrity of Tone Signal. we Bring Forward to a Mathematical Morphological Processing of Spectrograms for the Tone of Chinese Vowels. Firstly, we will have Pretreatment to Recording Good Tone Signal by Using Cooledit Pro Software, and Converted into Spectrograms; Secondly, we will do Smooth and the Normalized Pretreatment to Spectrograms by Mathematical Morphological Processing; Finally, we get Whole Direction Angle Statistics of Tone Signal by Skeletonization way. the Neural Networks Stimulation Shows that the Speech Emotion Recognition Rate can Reach 92.50%.

Download Full-text

Characteristic Parameters Extraction and Pattern Recognition of Partial Discharges based on Envelope of Ultra-high Frequency Signal

Proceedings of the 2015 4th International Conference on Sensors, Measurement and Intelligent Materials ◽

10.2991/icsmim-15.2016.228 ◽

2016 ◽

Author(s):

Zhaoli Gao ◽

Yingtao Sun ◽

Lingen Luo ◽

Gehao Sheng ◽

Xiuchen Jiang

Keyword(s):

Pattern Recognition ◽

High Frequency ◽

Partial Discharges ◽

Characteristic Parameters ◽

Frequency Signal ◽

High Frequency Signal ◽

Ultra High Frequency ◽

Parameters Extraction

Download Full-text

Research on Speech Emotion Recognition Based on Weighted Euclidean Distance

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.543-547.2192 ◽

2014 ◽

Vol 543-547 ◽

pp. 2192-2195 ◽

Cited By ~ 1

Author(s):

Chen Chen Huang ◽

Wei Gong ◽

Wen Long Fu ◽

Dong Yu Feng

Keyword(s):

Emotion Recognition ◽

Template Matching ◽

Euclidean Distance ◽

Recognition Rate ◽

Speech Emotion Recognition ◽

Human Beings ◽

Analysis Method ◽

Characteristic Parameters ◽

Emotional Information ◽

Voice Data

As the most important medium of communication in human beings life, speech carries abundant emotional information. In recent years, how to recognize the speakers emotional state automatically from the speech is attracting extensive attention of researchers in various fields. In this paper, we studied the method of speech emotion recognition. We collected a total of 360 sentences from four speakers with the emotional statement about happiness, anger, surprise, sadness, and extracted eight emotional characteristics from these voice data. Contribution analysis method is proposed to determine the value of emotion characteristic parameters. We also have used the weighted Euclidean distance template matching to identify the speech emotion, got more than 80% of the average emotional recognition rate.

Download Full-text

Emotional Interactive Simulation System of English Speech Recognition in Virtual Context

Complexity ◽

10.1155/2020/9409630 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Dan Li

Keyword(s):

Speech Recognition ◽

Emotion Recognition ◽

Hidden Markov ◽

Recognition Rate ◽

Simulation System ◽

Speech Emotion Recognition ◽

Chain Model ◽

Interactive Simulation ◽

Human Ear ◽

The Impact

With the development of virtual scenes, the degree of simulation and functions of virtual reality have been very complete, providing a new platform and perspective for teaching design. Firstly, the hidden Markov chain model is used to perform emotion recognition on English speech signals. English speech emotion recognition and speech semantic recognition are essentially the same. Hidden Markov style has been widely used in English speech semantic recognition. The experiments of feature extraction and pattern recognition of speech samples prove that Hidden Markovian has higher recognition rate and better recognition effect in speech emotion recognition. Secondly, combining the human pronunciation model and the hearing model, by analyzing the impact of the glottis feature on the human ear hearing-model feature, the research application of the English speech recognition emotion interactive simulation system uses the glottis feature to compensate the human ear, hearing feature is proposed by compensated English speech recognition, and emotion interaction simulation system is used in the English speech emotion experiment, which has obtained a high recognition rate and showed excellent performance.

Download Full-text

MTPA Control of Sensorless IPMSM Drive System Based on Virtual and Actual High-Frequency Signal Injection

IEEE Transactions on Transportation Electrification ◽

10.1109/tte.2020.3048582 ◽

2021 ◽

pp. 1-1

Author(s):

Jiahao Zhang ◽

Guohai Liu ◽

Qian Chen

Keyword(s):

High Frequency ◽

Drive System ◽

Frequency Signal ◽

High Frequency Signal ◽

Signal Injection

Download Full-text

A Study on the Extraction Method of Partial Discharge Features in Gas Insulated Switchgear Based on Ultra-High Frequency Signal Envelope

Journal of Physics Conference Series ◽

10.1088/1742-6596/1659/1/012061 ◽

2020 ◽

Vol 1659 ◽

pp. 012061

Author(s):

Xiaoxin Chen ◽

Lin Zhao ◽

Shaoan Wang ◽

Xiang Sun ◽

Ping Qian

Keyword(s):

High Frequency ◽

Extraction Method ◽

Partial Discharge ◽

Frequency Signal ◽

High Frequency Signal ◽

Ultra High Frequency ◽

Signal Envelope ◽

Gas Insulated Switchgear

Download Full-text

A Research of Speech Emotion Recognition Based on Deep Belief Network and SVM

Mathematical Problems in Engineering ◽

10.1155/2014/749604 ◽

2014 ◽

Vol 2014 ◽

pp. 1-7 ◽

Cited By ~ 21

Author(s):

Chenchen Huang ◽

Wei Gong ◽

Wenlong Fu ◽

Dongyu Feng

Keyword(s):

Feature Extraction ◽

Emotion Recognition ◽

Recognition Rate ◽

Original Method ◽

Speech Emotion Recognition ◽

High Dimensional ◽

Svm Classifier ◽

Multiple Classifier System ◽

Classifier System ◽

Multiple Classifier

Feature extraction is a very important part in speech emotion recognition, and in allusion to feature extraction in speech emotion recognition problems, this paper proposed a new method of feature extraction, using DBNs in DNN to extract emotional features in speech signal automatically. By training a 5 layers depth DBNs, to extract speech emotion feature and incorporate multiple consecutive frames to form a high dimensional feature. The features after training in DBNs were the input of nonlinear SVM classifier, and finally speech emotion recognition multiple classifier system was achieved. The speech emotion recognition rate of the system reached 86.5%, which was 7% higher than the original method.

Download Full-text

Dither in a rolling airframe flight vehicle with a two-position actuator: An amplitude distribution approach

Transactions of the Institute of Measurement and Control ◽

10.1177/0142331216631190 ◽

2016 ◽

Vol 39 (8) ◽

pp. 1205-1215 ◽

Cited By ~ 1

Author(s):

Bahram Mohammadi ◽

Mohammad Reza Arvan ◽

Yousof Koohmaskan

Keyword(s):

High Frequency ◽

Degrees Of Freedom ◽

Coupling Effect ◽

Cross Coupling ◽

Amplitude Distribution ◽

Feedback System ◽

Flight Vehicle ◽

Frequency Signal ◽

High Frequency Signal ◽

Minimal Error

Rolling airframe manoeuvring is a type of manoeuvre in which the missile provides continuous roll during flight. Cross-coupling between the angle of attack and sideslip in rolling airframe missiles (RAMs) yields a coning motion around the flight path. As the pitch and yaw cross-coupling effect decreases, the radius of this coning motion decreases and the accuracy of the control system increases. Two-position (on–off) actuators are used in most RAMs. The presence of a two-position actuator in a feedback system makes its characteristics non-linear. A high-frequency signal so-called dither is applied to compensate for the non-linearity effect of the actuator characteristic in the feedback system and to stabilize the coning motion. The amplitude distribution function (ADF) method in dither analysis shows that the smoothed non-linearity characteristic can be computed as the convolution of the original non-linearity and the ADF of the dither signal. According to the four-degrees-of-freedom (4-DOF) equations of RAMs in a non-rolling frame and regarding various dither signals through the ADF approach on a two-position actuator, an analytical condition for dither amplitude in coning motion stability of RAMs is derived. It was shown that the triangular signal with specified amplitude and high enough frequency led to a smoother response of two-position actuators. Finally, by applying beam-riding guidance to a RAM, the performance of dithers for decreasing the distance of the missile from the centre of the beam is validated through simulations. It is illustrated that applying the triangular dither resulted in minimal error.

Download Full-text