No Sample Left Behind: Towards a Comprehensive Evaluation of Speech Emotion Recognition Systems

Author(s):  
Pablo Riera ◽  
Luciana Ferrer ◽  
Agustín Gravano ◽  
Lara Gauder
Author(s):  
Shreya Kumar ◽  
Swarnalaxmi Thiruvenkadam

Feature extraction is an integral part in speech emotion recognition. Some emotions become indistinguishable from others due to high resemblance in their features, which results in low prediction accuracy. This paper analyses the impact of spectral contrast feature in increasing the accuracy for such emotions. The RAVDESS dataset has been chosen for this study. The SAVEE dataset, CREMA-D dataset and JL corpus dataset were also used to test its performance over different English accents. In addition to that, EmoDB dataset has been used to study its performance in the German language. The use of spectral contrast feature has increased the prediction accuracy in speech emotion recognition systems to a good degree as it performs well in distinguishing emotions with significant differences in arousal levels, and it has been discussed in detail.<div> </div>


IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 47795-47814
Author(s):  
Taiba Majid Wani ◽  
Teddy Surya Gunawan ◽  
Syed Asif Ahmad Qadri ◽  
Mira Kartiwi ◽  
Eliathamby Ambikairajah

2013 ◽  
Vol 38 (4) ◽  
pp. 465-470 ◽  
Author(s):  
Jingjie Yan ◽  
Xiaolan Wang ◽  
Weiyi Gu ◽  
LiLi Ma

Abstract Speech emotion recognition is deemed to be a meaningful and intractable issue among a number of do- mains comprising sentiment analysis, computer science, pedagogy, and so on. In this study, we investigate speech emotion recognition based on sparse partial least squares regression (SPLSR) approach in depth. We make use of the sparse partial least squares regression method to implement the feature selection and dimensionality reduction on the whole acquired speech emotion features. By the means of exploiting the SPLSR method, the component parts of those redundant and meaningless speech emotion features are lessened to zero while those serviceable and informative speech emotion features are maintained and selected to the following classification step. A number of tests on Berlin database reveal that the recogni- tion rate of the SPLSR method can reach up to 79.23% and is superior to other compared dimensionality reduction methods.


Author(s):  
Promod Yenigalla ◽  
Abhay Kumar ◽  
Suraj Tripathi ◽  
Chirag Singh ◽  
Sibsambhu Kar ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document