Performance Improvement of Speech Emotion Recognition Model Using Generative Adversarial Networks

The Journal of Korean Institute of Information Technology ◽

10.14801/jkiit.2019.17.11.77 ◽

2019 ◽

Vol 17 (11) ◽

pp. 77-85

Author(s):

You-Jung Ko ◽

Yoon-Joong Kim

Keyword(s):

Emotion Recognition ◽

Performance Improvement ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Recognition Model ◽

Adversarial Networks

Download Full-text

Robust Semisupervised Generative Adversarial Networks for Speech Emotion Recognition via Distribution Smoothness

IEEE Access ◽

10.1109/access.2020.3000751 ◽

2020 ◽

Vol 8 ◽

pp. 106889-106900 ◽

Author(s):

Huan Zhao ◽

Yufeng Xiao ◽

Zixing Zhang

Keyword(s):

Emotion Recognition ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Adversarial Networks

Download Full-text

Speech Emotion Recognition using Data Augmentation Method by Cycle-Generative Adversarial Networks

10.20944/preprints202104.0651.v1 ◽

2021 ◽

Author(s):

Arash Shilandari ◽

Hossein Marvi ◽

Hossein Khosravi

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Speech Processing ◽

Data Augmentation ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Support Vector ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks

Nowadays, and with the mechanization of life, speech processing has become so crucial for the interaction between humans and machines. Deep neural networks require a database with enough data for training. The more features are extracted from the speech signal, the more samples are needed to train these networks. Adequate training of these networks can be ensured when there is access to sufficient and varied data in each class. If there is not enough data; it is possible to use data augmentation methods to obtain a database with enough samples. One of the obstacles to developing speech emotion recognition systems is the Data sparsity problem in each class for neural network training. The current study has focused on making a cycle generative adversarial network for data augmentation in a system for speech emotion recognition. For each of the five emotions employed, an adversarial generating network is designed to generate data that is very similar to the main data in that class, as well as differentiate the emotions of the other classes. These networks are taught in an adversarial way to produce feature vectors like each class in the space of the main feature, and then they add to the training sets existing in the database to train the classifier network. Instead of using the common cross-entropy error to train generative adversarial networks and to remove the vanishing gradient problem, Wasserstein Divergence has been used to produce high-quality artificial samples. The suggested network has been tested to be applied for speech emotion recognition using EMODB as training, testing, and evaluating sets, and the quality of artificial data evaluated using two Support Vector Machine (SVM) and Deep Neural Network (DNN) classifiers. Moreover, it has been revealed that extracting and reproducing high-level features from acoustic features, speech emotion recognition with separating five primary emotions has been done with acceptable accuracy.

Download Full-text

Augmenting Generative Adversarial Networks for Speech Emotion Recognition

10.21437/interspeech.2020-3194 ◽

2020 ◽

Author(s):

Siddique Latif ◽

Muhammad Asim ◽

Rajib Rana ◽

Sara Khalifa ◽

Raja Jurdak ◽

...

Keyword(s):

Emotion Recognition ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Adversarial Networks

Download Full-text

On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks

10.21437/interspeech.2018-1883 ◽

2018 ◽

Author(s):

Saurabh Sahu ◽

Rahul Gupta ◽

Carol Espy-Wilson

Keyword(s):

Emotion Recognition ◽

Generative Adversarial Networks ◽

Speech Emotion Recognition ◽

Adversarial Networks

Download Full-text

Two-level discriminative speech emotion recognition model with wave field dynamics: A personalized speech emotion recognition method

Computer Communications ◽

10.1016/j.comcom.2021.09.013 ◽

2021 ◽

Author(s):

Ning Jia ◽

Chunjun Zheng

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Method ◽

Recognition Model

Download Full-text

Construction and Research of E-sports Speech Emotion Recognition Model

Lecture Notes in Electrical Engineering - Innovative Computing ◽

10.1007/978-981-16-4258-6_4 ◽

2022 ◽

pp. 23-31

Author(s):

Jason C. Hung ◽

Jin-Che Chen

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Model

Download Full-text

CNN-based Speech Emotion Recognition Model Applying Transfer Learning and Attention Mechanism

Journal of KIISE ◽

10.5626/jok.2020.47.7.665 ◽

2020 ◽

Vol 47 (7) ◽

pp. 665-673

Author(s):

Jung Hyun Lee ◽

Ui Nyoung Yoon ◽

Geun-Sik Jo

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Attention Mechanism ◽

Speech Emotion Recognition ◽

Recognition Model

Download Full-text

Speech emotion recognition model based on Bi-GRU and Focal Loss

Pattern Recognition Letters ◽

10.1016/j.patrec.2020.11.009 ◽

2020 ◽

Vol 140 ◽

pp. 358-365

Author(s):

Zijiang Zhu ◽

Weihuang Dai ◽

Yi Hu ◽

Junshan Li

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Model ◽

Download Full-text

Speech Emotion Recognition Model Based on CRNN-CTC

Advances in Intelligent Systems and Computing - 2020 International Conference on Applications and Techniques in Cyber Intelligence ◽

10.1007/978-3-030-53980-1_113 ◽

2020 ◽

pp. 771-778

Author(s):

Zijiang Zhu ◽

Weihuang Dai ◽

Yi Hu ◽

Junhua Wang ◽

Junshan Li

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Model ◽

Download Full-text

A Speech Emotion Recognition Model Based on Multi-Level Local Binary and Local Ternary Patterns

IEEE Access ◽

10.1109/access.2020.3031763 ◽

2020 ◽

Vol 8 ◽

pp. 190784-190796

Author(s):

Yesim Ulgen Sonmez ◽

Asaf Varol

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Recognition Model ◽

Model Based ◽

Local Ternary Patterns ◽

Download Full-text