Domain Adaptation Techniques for EEG-Based Emotion Recognition: A Comparative Study on Two Public Datasets

Speech emotion recognition (SER) is a natural method of recognizing individual emotions in everyday life. To distribute SER models to real-world applications, some key challenges must be overcome, such as the lack of datasets tagged with emotion labels and the weak generalization of the SER model for an unseen target domain. This study proposes a multi-path and group-loss-based network (MPGLN) for SER to support multi-domain adaptation. The proposed model includes a bidirectional long short-term memory-based temporal feature generator and a transferred feature extractor from the pre-trained VGG-like audio classification model (VGGish), and it learns simultaneously based on multiple losses according to the association of emotion labels in the discrete and dimensional models. For the evaluation of the MPGLN SER as applied to multi-cultural domain datasets, the Korean Emotional Speech Database (KESD), including KESDy18 and KESDy19, is constructed, and the English-speaking Interactive Emotional Dyadic Motion Capture database (IEMOCAP) is used. The evaluation of multi-domain adaptation and domain generalization showed 3.7% and 3.5% improvements, respectively, of the F1 score when comparing the performance of MPGLN SER with a baseline SER model that uses a temporal feature generator. We show that the MPGLN SER efficiently supports multi-domain adaptation and reinforces model generalization.

Download Full-text

Cross-corpus Speech Emotion Recognition based on Few-shot Learning and Domain Adaptation

IEEE Signal Processing Letters ◽

10.1109/lsp.2021.3086395 ◽

2021 ◽

pp. 1-1

Author(s):

Youngdo Ahn ◽

Sung Joo Lee ◽

Jong Won Shin

Keyword(s):

Emotion Recognition ◽

Domain Adaptation ◽

Speech Emotion Recognition

Download Full-text

A Comparative Study on Detection Accuracy of Cloud-Based Emotion Recognition Services

Proceedings of the 2018 International Conference on Signal Processing and Machine Learning - SPML '18 ◽

10.1145/3297067.3297079 ◽

2018 ◽

Cited By ~ 4

Author(s):

Osamah M. Al-Omair ◽

Shihong Huang

Keyword(s):

Comparative Study ◽

Emotion Recognition ◽

Detection Accuracy

Download Full-text

Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition

IEEE Signal Processing Letters ◽

10.1109/lsp.2017.2672753 ◽

2017 ◽

Vol 24 (4) ◽

pp. 500-504 ◽

Cited By ~ 37

Author(s):

Jun Deng ◽

Xinzhou Xu ◽

Zixing Zhang ◽

Sascha Fruhholz ◽

Bjorn Schuller

Keyword(s):

Emotion Recognition ◽

Domain Adaptation ◽

Speech Emotion Recognition

Download Full-text

Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition

10.21437/interspeech.2021-666 ◽

2021 ◽

Author(s):

Haoqi Li ◽

Yelin Kim ◽

Cheng-Hao Kuo ◽

Shrikanth S. Narayanan

Keyword(s):

Emotion Recognition ◽

Domain Adaptation

Download Full-text

A Comparative Study of Different Weighting Schemes on KNN-Based Emotion Recognition in Mandarin Speech

Lecture Notes in Computer Science - Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues ◽

10.1007/978-3-540-74171-8_101 ◽

2007 ◽

pp. 997-1005 ◽

Cited By ~ 9

Author(s):

Tsang-Long Pao ◽

Yu-Te Chen ◽

Jun-Heng Yeh ◽

Yun-Maw Cheng ◽

Yu-Yuan Lin

Keyword(s):

Comparative Study ◽

Emotion Recognition ◽

Weighting Schemes

Download Full-text

Comparative Study and Analysis of Various Facial Emotion Recognition Techniques

Asset Analytics - Decision Analytics Applications in Industry ◽

10.1007/978-981-15-3643-4_11 ◽

2020 ◽

pp. 157-172

Author(s):

Naveen Kumari ◽

Rekha Bhatia

Keyword(s):

Comparative Study ◽

Emotion Recognition ◽

Facial Emotion Recognition ◽

Facial Emotion

Download Full-text

Emotion Assessment Using Feature Fusion and Decision Fusion Classification Based on Physiological Data: Are We There Yet?

Sensors ◽

10.3390/s20174723 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4723

Author(s):

Patrícia Bota ◽

Chen Wang ◽

Ana Fred ◽

Hugo Silva

Keyword(s):

Emotion Recognition ◽

Classification Accuracy ◽

Feature Fusion ◽

State Of The Art ◽

Electrodermal Activity ◽

Decision Fusion ◽

Classification Performance ◽

Physiological Data ◽

Systematic Analysis ◽

Public Datasets

Emotion recognition based on physiological data classification has been a topic of increasingly growing interest for more than a decade. However, there is a lack of systematic analysis in literature regarding the selection of classifiers to use, sensor modalities, features and range of expected accuracy, just to name a few limitations. In this work, we evaluate emotion in terms of low/high arousal and valence classification through Supervised Learning (SL), Decision Fusion (DF) and Feature Fusion (FF) techniques using multimodal physiological data, namely, Electrocardiography (ECG), Electrodermal Activity (EDA), Respiration (RESP), or Blood Volume Pulse (BVP). The main contribution of our work is a systematic study across five public datasets commonly used in the Emotion Recognition (ER) state-of-the-art, namely: (1) Classification performance analysis of ER benchmarking datasets in the arousal/valence space; (2) Summarising the ranges of the classification accuracy reported across the existing literature; (3) Characterising the results for diverse classifiers, sensor modalities and feature set combinations for ER using accuracy and F1-score; (4) Exploration of an extended feature set for each modality; (5) Systematic analysis of multimodal classification in DF and FF approaches. The experimental results showed that FF is the most competitive technique in terms of classification accuracy and computational complexity. We obtain superior or comparable results to those reported in the state-of-the-art for the selected datasets.

Download Full-text