Deep Learning-based Categorical and Dimensional Emotion Recognition for Written and Spoken Text
The demand for recognizing emotion in text has grown increasingly as human emotion can be expressed via text and manytechnologies, such as product reviews and speech transcription, can benefit from text emotion recognition. The study of text emotionrecognition was established some decades ago using unsupervised learning and a small amount of data. Advancements incomputation hardware and in the development of larger text corpus have enabled us to analyze emotion in the text by moresophisticated techniques. This paper presents a deep learning-based approach for the recognition of categorical and dimensionalemotion from both written and spoken texts. The result shows that the system performs better on both categorical and dimensional task( > 60% accuracy and < 20% error) with a larger dataset compared to a smaller dataset. We also found the recognition rate is affectedby both the size of the data and the number of emotion categories. On the dimensional task, a larger amount of data consistentlyprovided a better result. Recognition of categorical emotion on a spoken text is easier than on a written text, while on dimensional task,the written text yielded better performance.