The Investigation of Different Loss Functions with Capsule Networks for Speech Emotion Recognition
Keyword(s):
Speech emotion recognition (SER) is an important research topic. Image features like spectrograms are one of the common ways of extracting information from speech. In the area of image recognition, a relatively novel type of network called capsule networks has shown good and promising results. This study aims to use capsule networks to encode spatial information from spectrograms and analyse its performance when paired with different loss functions. Experiments comparing the capsule network with models from previous works show that the capsule network performs better than them.
2014 ◽
Vol 602-605
◽
pp. 3570-3574
2014 ◽
Vol 46
(1)
◽
pp. 145-161
2011 ◽
pp. 179-212
◽
2019 ◽
Vol 10
(4)
◽
pp. 1-25