Deep Learning Approaches for Speech Emotion Recognition

Algorithms for Intelligent Systems - Deep Learning-Based Approaches for Sentiment Analysis ◽

10.1007/978-981-15-1216-2_10 ◽

2020 ◽

pp. 259-289

Author(s):

Anjali Bhavan ◽

Mohit Sharma ◽

Mehak Piplani ◽

Pankaj Chauhan ◽

Hitkul ◽

...

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Learning Approaches

Download Full-text

Deep learning approaches for speech emotion recognition: state of the art and research challenges

Multimedia Tools and Applications ◽

10.1007/s11042-020-09874-7 ◽

2021 ◽

Author(s):

Rashid Jahangir ◽

Ying Wah Teh ◽

Faiqa Hanif ◽

Ghulam Mujtaba

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

State Of The Art ◽

Speech Emotion Recognition ◽

Learning Approaches ◽

Research Challenges

Download Full-text

Correction to: Deep learning approaches for speech emotion recognition: state of the art and research challenges

Multimedia Tools and Applications ◽

10.1007/s11042-021-10967-0 ◽

2021 ◽

Author(s):

Rashid Jahangir ◽

Ying Wah Teh ◽

Faiqa Hanif ◽

Ghulam Mujtaba

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

State Of The Art ◽

Speech Emotion Recognition ◽

Learning Approaches ◽

Research Challenges

Download Full-text

Effective speech emotion recognition using deep learning approaches for Algerian dialect

2021 International Conference of Women in Data Science at Taif University (WiDSTaif ) ◽

10.1109/widstaif52235.2021.9430224 ◽

2021 ◽

Author(s):

Raoudha Yahia Cherif ◽

Abdelouahab Moussaoui ◽

Nabila Frahta ◽

Mohamed Berrimi

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Learning Approaches

Download Full-text

A Method upon Deep Learning for Speech Emotion Recognition

Journal of Advanced Engineering and Computation ◽

10.25073/jaec.202044.311 ◽

2021 ◽

Vol 4 (4) ◽

pp. 273

Author(s):

Nhat Truong Pham ◽

Ngoc Minh Duc Dang ◽

Sy Dung Nguyen

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

Learning Approaches ◽

Global Features ◽

Time Step ◽

Creative Commons ◽

Computational Resources ◽

Deep Learning Model

Feature extraction and emotional classification are significant roles in speech emotion recognition. It is hard to extract and select the optimal features, researchers can not be sure what the features should be. With deep learning approaches, features could be extracted by using hierarchical abstraction layers, but it requires high computational resources and a large number of data. In this article, we choose static, differential, and acceleration coefficients of log Mel-spectrogram as inputs for the deep learning model. To avoid performance degradation, we also add a skip connection with dilated convolution network integration. All representatives are fed into a self-attention mechanism with bidirectional recurrent neural networks to learn long term global features and exploit context for each time step. Finally, we investigate contrastive center loss with softmax loss as loss function to improve the accuracy of emotion recognition. For validating robustness and effectiveness, we tested the proposed method on the Emo-DB and ERC2019 datasets. Experimental results show that the performance of the proposed method is strongly comparable with the existing state-of-the-art methods on the Emo-DB and ERC2019 with 88% and 67%, respectively. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium provided the original work is properly cited.

Download Full-text

Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models

Sensors ◽

10.3390/s21041249 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1249

Author(s):

Babak Joze Abbaschian ◽

Daniel Sierra-Sosa ◽

Adel Elmaghraby

Keyword(s):

Deep Learning ◽

Emotion Recognition ◽

Machine Learning Techniques ◽

Speech Emotion Recognition ◽

Learning Approaches ◽

Human Computer Interactions ◽

Learning Techniques ◽

Conventional Machine ◽

Feasible Solutions ◽

Network Approaches

The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition.

Download Full-text