On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2018.8593571 ◽

2018 ◽

Author(s):

Egor Lakomkin ◽

Mohammad Ali Zamani ◽

Cornelius Weber ◽

Sven Magg ◽

Stefan Wermter

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Deep Neural Networks ◽

Human Robot Interaction ◽

Speech Emotion Recognition ◽

Robot Interaction

Download Full-text

Speech Emotion Recognition Using Deep Neural Networks on Multilingual Databases

Advances in Robotics, Automation and Data Analytics - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-70917-4_3 ◽

2021 ◽

pp. 21-30

Author(s):

Syed Asif Ahmad Qadri ◽

Teddy Surya Gunawan ◽

Taiba Majid Wani ◽

Eliathamby Ambikairajah ◽

Mira Kartiwi ◽

...

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Deep Neural Networks ◽

Speech Emotion Recognition

Download Full-text

On-the-Fly Detection of User Engagement Decrease in Spontaneous Human–Robot Interaction Using Recurrent and Deep Neural Networks

International Journal of Social Robotics ◽

10.1007/s12369-019-00591-2 ◽

2019 ◽

Vol 11 (5) ◽

pp. 815-828

Author(s):

Atef Ben-Youssef ◽

Giovanna Varni ◽

Slim Essid ◽

Chloé Clavel

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Human Robot Interaction ◽

User Engagement ◽

Robot Interaction

Download Full-text

Towards real-time Speech Emotion Recognition using deep neural networks

2015 9th International Conference on Signal Processing and Communication Systems (ICSPCS) ◽

10.1109/icspcs.2015.7391796 ◽

2015 ◽

Author(s):

H.M. Fayek ◽

M. Lech ◽

L. Cavedon

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Real Time ◽

Deep Neural Networks ◽

Speech Emotion Recognition

Download Full-text

Speech emotion recognition on mobile devices based on modulation spectral feature pooling and deep neural networks

2017 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) ◽

10.1109/isspit.2017.8388669 ◽

2017 ◽

Author(s):

Anderson R. Avila ◽

Joao Monteiro ◽

Douglas O'Shaughneussy ◽

Tiago H. Falk

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Mobile Devices ◽

Deep Neural Networks ◽

Spectral Feature ◽

Speech Emotion Recognition ◽

Feature Pooling

Download Full-text

Speech emotion recognition in emotional feedback for Human-Robot Interaction

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ARTIFICIAL INTELLIGENCE ◽

10.14569/ijarai.2015.040204 ◽

2015 ◽

Vol 4 (2) ◽

Author(s):

Javier G. ◽

David Sundgren ◽

Rahim Rahmani ◽

Aron Larsson ◽

Antonio Moran ◽

...

Keyword(s):

Emotion Recognition ◽

Human Robot Interaction ◽

Speech Emotion Recognition ◽

Robot Interaction

Download Full-text

f-Similarity Preservation Loss for Soft Labels: A Demonstration on Cross-Corpus Speech Emotion Recognition

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015725 ◽

2019 ◽

Vol 33 ◽

pp. 5725-5732

Author(s):

Biqiao Zhang ◽

Yuqing Kong ◽

Georg Essl ◽

Emily Mower Provost

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Loss Function ◽

Deep Neural Networks ◽

Metric Learning ◽

Loss Functions ◽

Speech Emotion Recognition ◽

Subjective Data ◽

Dual Form ◽

Deep Metric Learning

In this paper, we propose a Deep Metric Learning (DML) approach that supports soft labels. DML seeks to learn representations that encode the similarity between examples through deep neural networks. DML generally presupposes that data can be divided into discrete classes using hard labels. However, some tasks, such as our exemplary domain of speech emotion recognition (SER), work with inherently subjective data, data for which it may not be possible to identify a single hard label. We propose a family of loss functions, fSimilarity Preservation Loss (f-SPL), based on the dual form of f-divergence for DML with soft labels. We show that the minimizer of f-SPL preserves the pairwise label similarities in the learned feature embeddings. We demonstrate the efficacy of the proposed loss function on the task of cross-corpus SER with soft labels. Our approach, which combines f-SPL and classification loss, significantly outperforms a baseline SER system with the same structure but trained with only classification loss in most experiments. We show that the presented techniques are more robust to over-training and can learn an embedding space in which the similarity between examples is meaningful.

Download Full-text

Speech Emotion Recognition using Deep Neural Networks

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2020.6395 ◽

2020 ◽

Vol 8 (6) ◽

pp. 2460-2465

Author(s):

Balaji Dharamsoth

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Deep Neural Networks ◽

Speech Emotion Recognition

Download Full-text

Interaffection of Multiple Datasets with Neural Networks in Speech Emotion Recognition

10.5753/eniac.2020.12141 ◽

2020 ◽

Author(s):

Ronnypetson Da Silva ◽

Valter M. Filho ◽

Mario Souza

Keyword(s):

Neural Network ◽

Neural Networks ◽

Emotion Recognition ◽

Deep Neural Networks ◽

Speech Emotion Recognition ◽

Network Architectures ◽

Shared Representations ◽

Multiple Datasets ◽

Neural Network Architectures

Many works that apply Deep Neural Networks (DNNs) to Speech Emotion Recognition (SER) use single datasets or train and evaluate the models separately when using multiple datasets. Those datasets are constructed with specific guidelines and the subjective nature of the labels for SER makes it difficult to obtain robust and general models. We investigate how DNNs learn shared representations for different datasets in both multi-task and unified setups. We also analyse how each dataset benefits from others in different combinations of datasets and popular neural network architectures. We show that the longstanding belief of more data resulting in more general models doesn’t always hold for SER, as different dataset and meta-parameter combinations hold the best result for each of the analysed datasets.

Download Full-text

Speech Emotion Recognition Using an Enhanced Kernel Isomap for Human-Robot Interaction

International Journal of Advanced Robotic Systems ◽

10.5772/55403 ◽

2013 ◽

Vol 10 (2) ◽

pp. 114 ◽

Author(s):

Shiqing Zhang ◽

Xiaoming Zhao ◽

Bicheng Lei

Keyword(s):

Emotion Recognition ◽

Human Robot Interaction ◽

Speech Emotion Recognition ◽

Robot Interaction

Download Full-text

Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction

Information Sciences ◽

10.1016/j.ins.2019.09.005 ◽

2020 ◽

Vol 509 ◽

pp. 150-163 ◽

Author(s):

Luefeng Chen ◽

Wanjuan Su ◽

Yu Feng ◽

Min Wu ◽

Jinhua She ◽

...

Keyword(s):

Random Forest ◽

Emotion Recognition ◽

Human Robot Interaction ◽

Speech Emotion Recognition ◽

Robot Interaction

Download Full-text