A speech emotion recognition method in cross-languages corpus based on feature adaptation

A new recognition method based on Gaussian mixture model for speech emotion recognition is proposed in this paper. To improve the effectiveness of feature extraction and accuracy of emotion recognition, extraction of Mel frequency cepstrum coefficient combined with Gaussian mixture model is used to recognize speech emotion. According to feature parameters extraction method by analyzing the principle of vocalization theory, emotion models based on Gaussian mixture model are generated and the similarity of their templates is obtained. A series of experiments is performed with recorded speech based on Gaussian mixture model and indicates the system gains high performance and better robustness.

Download Full-text

Speech Emotion Recognition method using time-stretching in the Preprocessing Phase and Artificial Neural Network Classifiers

2020 IEEE 16th International Conference on Intelligent Computer Communication and Processing (ICCP) ◽

10.1109/iccp51029.2020.9266265 ◽

2020 ◽

Author(s):

Valentin Catalin Govoreanu ◽

Mihai Neghina

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Method ◽

Artificial Neural ◽

Neural Network Classifiers ◽

Time Stretching

Download Full-text

Speech emotion recognition method based on hidden factor analysis

Electronics Letters ◽

10.1049/el.2014.3339 ◽

2015 ◽

Vol 51 (1) ◽

pp. 112-114 ◽

Cited By ~ 11

Author(s):

Peng Song ◽

Yun Jin ◽

Cheng Zha ◽

Li Zhao

Keyword(s):

Factor Analysis ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Method

Download Full-text

Improving Speech Emotion Recognition Method of Convolutional Neural Network

International Journal of Recent Engineering Science ◽

10.14445/23497157/ijres-v5i3p101 ◽

2018 ◽

Vol 5 (3) ◽

pp. 1-7

Author(s):

ZENG Runhua ◽

ZHANG Shuqun

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Method

Download Full-text

Arabic Speech Emotion Recognition Method Based On LPC And PPSD

2021 2nd International Conference on Computation, Automation and Knowledge Management (ICCAKM) ◽

10.1109/iccakm50778.2021.9357769 ◽

2021 ◽

Author(s):

Omar Ahmad Mohammad ◽

Mourad Elhadef

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Method

Download Full-text

Research on the Construction of Human-Computer Interaction System Based on a Machine Learning Algorithm

Journal of Sensors ◽

10.1155/2022/3817226 ◽

2022 ◽

Vol 2022 ◽

pp. 1-11

Author(s):

Yu Wang

Keyword(s):

Machine Learning ◽

Feature Extraction ◽

Human Computer Interaction ◽

Emotion Recognition ◽

Feature Matching ◽

Speech Emotion Recognition ◽

Dialogue System ◽

Recognition Method ◽

Dynamic Planning ◽

Planning Algorithm

In this paper, we use machine learning algorithms to conduct in-depth research and analysis on the construction of human-computer interaction systems and propose a simple and effective method for extracting salient features based on contextual information. The method can retain the dynamic and static information of gestures intact, which results in a richer and more robust feature representation. Secondly, this paper proposes a dynamic planning algorithm based on feature matching, which uses the consistency and accuracy of feature matching to measure the similarity of two frames and then uses a dynamic planning algorithm to find the optimal matching distance between two gesture sequences. The algorithm ensures the continuity and accuracy of the gesture description and makes full use of the spatiotemporal location information of the features. The features and limitations of common motion target detection methods in motion gesture detection and common machine learning tracking methods in gesture tracking are first analyzed, and then, the kernel correlation filter method is improved by designing a confidence model and introducing a scale filter, and finally, comparison experiments are conducted on a self-built gesture dataset to verify the effectiveness of the improved method. During the training and validation of the model by the corpus, the complementary feature extraction methods are ablated and learned, and the corresponding results obtained are compared with the three baseline methods. But due to this feature, GMMs are not suitable when users want to model the time structure. It has been widely used in classification tasks. By using the kernel function, the support vector machine can transform the original input set into a high-dimensional feature space. After experiments, the speech emotion recognition method proposed in this paper outperforms the baseline methods, proving the effectiveness of complementary feature extraction and the superiority of the deep learning model. The speech is used as the input of the system, and the emotion recognition is performed on the input speech, and the corresponding emotion obtained is successfully applied to the human-computer dialogue system in combination with the online speech recognition method, which proves that the speech emotion recognition applied to the human-computer dialogue system has application research value.

Download Full-text