Emotional Speech Recognition using Deep Learning

Majlesi Journal of Electrical Engineering ◽

10.29252/mjee.14.4.39 ◽

2020 ◽

Vol 14 (4) ◽

pp. 39-55

Author(s):

Othman O. Khalifa ◽

M. I. Alhamada ◽

Aisha H. Abdalla

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Emotional Speech ◽

Emotional Speech Recognition

Download Full-text

Deep Learning for Emotional Speech Recognition

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/978-3-319-07491-7_32 ◽

2014 ◽

pp. 311-320 ◽

Author(s):

Máximo E. Sánchez-Gutiérrez ◽

E. Marcelo Albornoz ◽

Fabiola Martinez-Licona ◽

H. Leonardo Rufiner ◽

John Goddard

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Emotional Speech ◽

Emotional Speech Recognition

Download Full-text

An Analysis of Emotional Speech Recognition for Tamil Language Using Deep Learning Gate Recurrent Unit

Pertanika Journal of Science and Technology ◽

10.47836/pjst.29.3.37 ◽

2021 ◽

Vol 29 (3) ◽

Author(s):

Bennilo Fernandes ◽

Kasiprasad Mannepalli

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Short Term Memory ◽

Estimation Method ◽

Emotional Speech ◽

New Approach ◽

Training Time ◽

Emotional Speech Recognition ◽

Gated Recurrent Unit

Designing the interaction among human language and a registered emotional database enables us to explore how the system performs and has multiple approaches for emotion detection in patient services. As of now, clustering techniques were primarily used in many prominent areas and in emotional speech recognition, even though it shows best results a new approach to the design is focused on Long Short-Term Memory (LSTM), Bi-Directional LSTM and Gated Recurrent Unit (GRU) as an estimation method for emotional Tamil datasets is available in this paper. A new approach of Deep Hierarchal LSTM/BiLSTM/GRU layer is designed to obtain the best result for long term learning voice dataset. Different combinations of deep learning hierarchal architecture like LSTM & GRU (DHLG), BiLSTM & GRU (DHBG), GRU & LSTM (DHGL), GRU & BiLSTM (DHGB) and dual GRU (DHGG) layer is designed with introduction of dropout layer to overcome the learning problem and gradient vanishing issues in emotional speech recognition. Moreover, to increase the design outcome within each emotional speech signal, various feature extraction combinations are utilized. From the analysis an average classification validity of the proposed DHGB model gives 82.86%, which is slightly higher than other models like DHGL (82.58), DHBG (82%), DHLG (81.14%) and DHGG (80%). Thus, by comparing all the models DHGB gives prominent outcome of 5% more than other four models with minimum training time and low dataset.

Download Full-text

Deep learning for emotional speech recognition

10.1063/5.0032381 ◽

2020 ◽

Author(s):

M. I. Alhamada ◽

O. O. Khalifa ◽

A. H. Abdalla

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Emotional Speech ◽

Emotional Speech Recognition

Download Full-text

Toward affective speech-to-speech translation: Strategy for emotional speech recognition and synthesis in multiple languages

Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific ◽

10.1109/apsipa.2014.7041623 ◽

2014 ◽

Author(s):

Masato Akagi ◽

Xiao Han ◽

Reda Elbarougy ◽

Yasuhiro Hamada ◽

Junfeng Li

Keyword(s):

Speech Recognition ◽

Emotional Speech ◽

Speech Translation ◽

Translation Strategy ◽

Emotional Speech Recognition ◽

Multiple Languages ◽

Speech To Speech Translation

Download Full-text

An on-line adaptation technique for emotional speech recognition using style estimation with multiple-regression HMM

10.21437/interspeech.2008-312 ◽

2008 ◽

Author(s):

Yusuke Ijima ◽

Makoto Tachibana ◽

Takashi Nose ◽

Takao Kobayashi

Keyword(s):

Speech Recognition ◽

Multiple Regression ◽

Emotional Speech ◽

On Line ◽

Emotional Speech Recognition

Download Full-text

A HMM-based Fuzzy Computing Model for Emotional Speech Recognition

2010 First International Conference on Pervasive Computing, Signal Processing and Applications ◽

10.1109/pcspa.2010.182 ◽

2010 ◽

Author(s):

Yuqiang Qin ◽

Xueying Zhang ◽

Hui Ying

Keyword(s):

Speech Recognition ◽

Emotional Speech ◽

Computing Model ◽

Emotional Speech Recognition ◽

Fuzzy Computing

Download Full-text

Emotional Speech Recognition Using SMILE Features and Random Forest Tree

Advances in Intelligent Systems and Computing - Intelligent Systems and Applications ◽

10.1007/978-3-030-29516-5_2 ◽

2019 ◽

pp. 10-17

Author(s):

Ammar Mohsin Butt ◽

Yusra Khalid Bhatti ◽

Fawad Hussain

Keyword(s):

Speech Recognition ◽

Random Forest ◽

Forest Tree ◽

Emotional Speech ◽

Random Forest Tree ◽

Emotional Speech Recognition

Download Full-text

Emotional Speech Recognition and Synthesis in Multiple Languages toward Affective Speech-to-Speech Translation System

2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing ◽

10.1109/iih-msp.2014.148 ◽

2014 ◽

Author(s):

Masato Akagi ◽

Xiao Han ◽

Reda Elbarougy ◽

Yasuhiro Hamada ◽

Junfeng Li

Keyword(s):

Speech Recognition ◽

Translation System ◽

Emotional Speech ◽

Speech Translation ◽

Emotional Speech Recognition ◽

Multiple Languages ◽

Speech To Speech Translation

Download Full-text

Emotional speech recognition: A multilingual perspective

2016 International Conference on Bio-engineering for Smart Technologies (BioSMART) ◽

10.1109/biosmart.2016.7835600 ◽

2016 ◽

Author(s):

Ali Meftah ◽

Yousef Alotaibi ◽

Sid-Ahmed Selouani

Keyword(s):

Speech Recognition ◽

Emotional Speech ◽

Emotional Speech Recognition

Download Full-text

Emotional Speech Recognition

Emotion, Affect and Personality in Speech - SpringerBriefs in Electrical and Computer Engineering ◽

10.1007/978-3-319-28047-9_5 ◽

2015 ◽

pp. 35-41 ◽

Author(s):

Swati Johar

Keyword(s):

Speech Recognition ◽

Emotional Speech ◽

Emotional Speech Recognition

Download Full-text