The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from the Speech Emotion Recognition Case

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414896 ◽

2021 ◽

Author(s):

Andreas Triantafyllopoulos ◽

Bjorn W. Schuller

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition ◽

Acoustic Similarity

Download Full-text

Speech Emotion Recognition among Couples using the Peak-End Rule and Transfer Learning

Companion Publication of the 2020 International Conference on Multimodal Interaction ◽

10.1145/3395035.3425253 ◽

2020 ◽

Author(s):

George Boateng ◽

Laura Sels ◽

Peter Kuppens ◽

Peter Hilpert ◽

Tobias Kowatsch

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition

Download Full-text

CNN-based Speech Emotion Recognition Model Applying Transfer Learning and Attention Mechanism

Journal of KIISE ◽

10.5626/jok.2020.47.7.665 ◽

2020 ◽

Vol 47 (7) ◽

pp. 665-673

Author(s):

Jung Hyun Lee ◽

Ui Nyoung Yoon ◽

Geun-Sik Jo

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Attention Mechanism ◽

Speech Emotion Recognition ◽

Recognition Model

Download Full-text

Transfer Learning for Speech Emotion Recognition

2019 IEEE 5th Intl Conference on Big Data Security on Cloud (BigDataSecurity), IEEE Intl Conference on High Performance and Smart Computing, (HPSC) and IEEE Intl Conference on Intelligent Data and Security (IDS) ◽

10.1109/bigdatasecurity-hpsc-ids.2019.00027 ◽

2019 ◽

Author(s):

Zhijie Han ◽

Huijuan Zhao ◽

Ruchuan Wang

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition

Download Full-text

Cross-Corpus Speech Emotion Recognition Based on Sparse Subspace Transfer Learning

10.1007/978-3-030-86608-2_51 ◽

2021 ◽

pp. 466-473

Author(s):

Keke Zhao ◽

Peng Song ◽

Wenjing Zhang ◽

Weijian Zhang ◽

Shaokai Li ◽

...

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition

Download Full-text

Speech Emotion Recognition Using Transfer Learning

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2014edl8038 ◽

2014 ◽

Vol E97.D (9) ◽

pp. 2530-2532 ◽

Author(s):

Peng SONG ◽

Yun JIN ◽

Li ZHAO ◽

Minghai XIN

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition

Download Full-text

A web crowdsourcing framework for transfer learning and personalized Speech Emotion Recognition

Machine Learning with Applications ◽

10.1016/j.mlwa.2021.100132 ◽

2021 ◽

pp. 100132

Author(s):

Nikolaos Vryzas ◽

Lazaros Vrysis ◽

Rigas Kotsakis ◽

Charalampos Dimoulas

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition

Download Full-text

Speech Emotion Recognition among Elderly Individuals using Multimodal Fusion and Transfer Learning

Companion Publication of the 2020 International Conference on Multimodal Interaction ◽

10.1145/3395035.3425255 ◽

2020 ◽

Author(s):

George Boateng ◽

Tobias Kowatsch

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Multimodal Fusion ◽

Speech Emotion Recognition ◽

Elderly Individuals

Download Full-text

EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition

IEEE Transactions on Affective Computing ◽

10.1109/taffc.2021.3135152 ◽

2021 ◽

pp. 1-1

Author(s):

Maurice Gerczuk ◽

Shahin Amiriparian ◽

Sandra Ottl ◽

Bjorn W. Schuller

Keyword(s):

Emotion Recognition ◽

Transfer Learning ◽

Speech Emotion Recognition ◽

Learning Framework

Download Full-text

Recognizing More Emotions with Less Data Using Self-supervised Transfer Learning

10.20944/preprints202008.0645.v1 ◽

2020 ◽

Author(s):

Jonathan Boigne ◽

Biman Liyanage ◽

Ted Östrem

Keyword(s):

Neural Network ◽

Emotion Recognition ◽

Transfer Learning ◽

Network Performance ◽

State Of The Art ◽

Research Community ◽

Training Data ◽

Linguistic Knowledge ◽

Speech Emotion Recognition ◽

Learning Method

We propose a novel transfer learning method for speech emotion recognition allowing us to obtain promising results when only few training data is available. With as low as 125 examples per emotion class, we were able to reach a higher accuracy than a strong baseline trained on 8 times more data. Our method leverages knowledge contained in pre-trained speech representations extracted from models trained on a more general self-supervised task which doesn’t require human annotations, such as the wav2vec model. We provide detailed insights on the benefits of our approach by varying the training data size, which can help labeling teams to work more efficiently. We compare performance with other popular methods on the IEMOCAP dataset, a well-benchmarked dataset among the Speech Emotion Recognition (SER) research community. Furthermore, we demonstrate that results can be greatly improved by combining acoustic and linguistic knowledge from transfer learning. We align acoustic pre-trained representations with semantic representations from the BERT model through an attention-based recurrent neural network. Performance improves significantly when combining both modalities and scales with the amount of data. When trained on the full IEMOCAP dataset, we reach a new state-of-the-art of 73.9% unweighted accuracy (UA).

Download Full-text

Role of gender influence in vocal Hindi conversations: A study on speech emotion recognition

2016 International Conference on Computing Communication Control and automation (ICCUBEA) ◽

10.1109/iccubea.2016.7860021 ◽

2016 ◽

Author(s):

Devika Verma ◽

Debajyoti Mukhopadhyay ◽

Emmanuel Mark

Keyword(s):

Emotion Recognition ◽

Speech Emotion Recognition ◽

Gender Influence

Download Full-text