Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks

<p>SMILES randomization, a form of data augmentation, has previously been shown to increase the performance of deep learning models compared to non-augmented baselines. Here, we propose a novel data augmentation method we call “Levenshtein augmentation” which considers local SMILES sub-sequence similarity between reactants and their respective products when creating training pairs. The performance of Levenshtein augmentation was tested using two state of the art models - transformer and sequence-to-sequence based recurrent neural networks with attention. Levenshtein augmentation demonstrated an increase performance over non-augmented, and conventionally SMILES randomization augmented data when used for training of baseline models. Furthermore, Levenshtein augmentation seemingly results in what we define as <i>attentional gain </i>– an enhancement in the pattern recognition capabilities of the underlying network to molecular motifs.</p>

Download Full-text

Emotion Recognition from Speech using Artificial Neural Networks and Recurrent Neural Networks

2021 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence) ◽

10.1109/confluence51648.2021.9377192 ◽

2021 ◽

Author(s):

Shambhavi Sharma

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Emotion Recognition ◽

Recurrent Neural Networks ◽

Artificial Neural

Download Full-text

Automatic speech emotion recognition using recurrent neural networks with local attention

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7952552 ◽

2017 ◽

Cited By ~ 118

Author(s):

Seyedmahdad Mirsamadi ◽

Emad Barsoum ◽

Cha Zhang

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Recurrent Neural Networks ◽

Speech Emotion Recognition

Download Full-text

Learning Salient Features for Multimodal Emotion Recognition with Recurrent Neural Networks and Attention Based Fusion

10.21437/avsp.2019-5 ◽

2019 ◽

Cited By ~ 1

Author(s):

Darshana Priyasad ◽

Tharindu Fernando ◽

Simon Denman ◽

Sridha Sridharan ◽

Clinton Fookes

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Recurrent Neural Networks ◽

Salient Features ◽

Multimodal Emotion Recognition

Download Full-text

Emotion recognition from spatiotemporal EEG representations with hybrid convolutional recurrent neural networks via wearable multi-channel headset

Computer Communications ◽

10.1016/j.comcom.2020.02.051 ◽

2020 ◽

Vol 154 ◽

pp. 58-65 ◽

Cited By ~ 8

Author(s):

Jingxia Chen ◽

Dongmei Jiang ◽

Yanning Zhang ◽

Pengwei Zhang

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Recurrent Neural Networks

Download Full-text

Convolutional Recurrent Neural Networks Based Speech Emotion Recognition

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.9321 ◽

2020 ◽

Vol 17 (8) ◽

pp. 3786-3789

Author(s):

P. Gayathri ◽

P. Gowri Priya ◽

L. Sravani ◽

Sandra Johnson ◽

Visanth Sampath

Keyword(s):

Neural Networks ◽

Emotion Recognition ◽

Recurrent Neural Networks ◽

Machine Learning Techniques ◽

Speech Emotion Recognition ◽

Emotional Information ◽

Feature Representations ◽

Emotional Factors ◽

Learning Techniques ◽

The Impact

Recognition of emotions is the aspect of speech recognition that is gaining more attention and the need for it is growing enormously. Although there are methods to identify emotion using machine learning techniques, we assume in this paper that calculating deltas and delta-deltas for customized features not only preserves effective emotional information, but also that the impact of irrelevant emotional factors, leading to a reduction in misclassification. Furthermore, Speech Emotion Recognition (SER) often suffers from the silent frames and irrelevant emotional frames. Meanwhile, the process of attention has demonstrated exceptional performance in learning related feature representations for specific tasks. Inspired by this, propose a Convolutionary Recurrent Neural Networks (ACRNN) based on Attention to learn discriminative features for SER, where the Mel-spectrogram with deltas and delta-deltas is used as input. Finally, experimental results show the feasibility of the proposed method and attain state-of-the-art performance in terms of unweighted average recall.

Download Full-text