Performance of deer hunting optimization based deep learning algorithm for speech emotion recognition

Author(s):  
Gaurav Agarwal ◽  
Hari Om
2020 ◽  
Vol 24 (5) ◽  
pp. 1065-1086
Author(s):  
Kudakwashe Zvarevashe ◽  
Oludayo O. Olugbara

Speech emotion recognition has become the heart of most human computer interaction applications in the modern world. The growing need to develop emotionally intelligent devices has opened up a lot of research opportunities. Most researchers in this field have applied the use of handcrafted features and machine learning techniques in recognising speech emotion. However, these techniques require extra processing steps and handcrafted features are usually not robust. They are computationally intensive because the curse of dimensionality results in low discriminating power. Research has shown that deep learning algorithms are effective for extracting robust and salient features in dataset. In this study, we have developed a custom 2D-convolution neural network that performs both feature extraction and classification of vocal utterances. The neural network has been evaluated against deep multilayer perceptron neural network and deep radial basis function neural network using the Berlin database of emotional speech, Ryerson audio-visual emotional speech database and Surrey audio-visual expressed emotion corpus. The described deep learning algorithm achieves the highest precision, recall and F1-scores when compared to other existing algorithms. It is observed that there may be need to develop customized solutions for different language settings depending on the area of applications.


Author(s):  
R Raja Subramanian ◽  
Chunduri Sandya Niharika ◽  
Dondapati Usha Rani ◽  
Parvathareddy Pavani ◽  
Ketepalli Poojita Lakshmi Syamala

Sign in / Sign up

Export Citation Format

Share Document