Convolutional Recurrent Neural Networks Based Speech Emotion Recognition

2020 ◽  
Vol 17 (8) ◽  
pp. 3786-3789
Author(s):  
P. Gayathri ◽  
P. Gowri Priya ◽  
L. Sravani ◽  
Sandra Johnson ◽  
Visanth Sampath

Recognition of emotions is the aspect of speech recognition that is gaining more attention and the need for it is growing enormously. Although there are methods to identify emotion using machine learning techniques, we assume in this paper that calculating deltas and delta-deltas for customized features not only preserves effective emotional information, but also that the impact of irrelevant emotional factors, leading to a reduction in misclassification. Furthermore, Speech Emotion Recognition (SER) often suffers from the silent frames and irrelevant emotional frames. Meanwhile, the process of attention has demonstrated exceptional performance in learning related feature representations for specific tasks. Inspired by this, propose a Convolutionary Recurrent Neural Networks (ACRNN) based on Attention to learn discriminative features for SER, where the Mel-spectrogram with deltas and delta-deltas is used as input. Finally, experimental results show the feasibility of the proposed method and attain state-of-the-art performance in terms of unweighted average recall.

Author(s):  
Todor D. Ganchev

In this chapter we review various computational models of locally recurrent neurons and deliberate the architecture of some archetypal locally recurrent neural networks (LRNNs) that are based on them. Generalizations of these structures are discussed as well. Furthermore, we point at a number of realworld applications of LRNNs that have been reported in past and recent publications. These applications involve classification or prediction of temporal sequences, discovering and modeling of spatial and temporal correlations, process identification and control, etc. Validation experiments reported in these developments provide evidence that locally recurrent architectures are capable of identifying and exploiting temporal and spatial correlations (i.e., the context in which events occur), which is the main reason for their advantageous performance when compared with the one of their non-recurrent counterparts or other reasonable machine learning techniques.


2018 ◽  
Vol 7 (2.32) ◽  
pp. 177 ◽  
Author(s):  
Dr M.R.Narasinga Rao ◽  
V Venkatesh Prasad ◽  
P Sai Teja ◽  
Md Zindavali ◽  
O Phanindra Reddy

Deep neural nets with a vast quantity of parameters are very effective machine getting to know structures. However, overfitting is an extreme problem in such networks. Massive networks are also sluggish to use, making it difficult to cope with overfitting by combining the predictions of many distinct large neural nets at test time. Dropout is a method for addressing this problem. The important thing concept is to randomly drop units (at the side of their connections) from the neural network for the duration of education. This prevents units from co-adapting an excessive amount of. during schooling, dropout samples from an exponential quantity of various "thinned" networks. At take a look at the time, it is simple to precise the impact of averaging the predictions of plenty of these thinned networks through in reality using a single unthinned network that has smaller weights. This considerably minimize overfitting and provides fundamental enhancements over other regularization techniques. We show that dropout enhance the overall performance of neural networks on manage gaining knowledge of obligations in imaginative and prescient, speech reputation, document type and computational biology, acquiring today's effects on many benchmark facts sets.  


2021 ◽  
pp. 1-105
Author(s):  
Diana Salazar Florez ◽  
Heather Bedle

Nowadays, there are many unsupervised and supervised machine learning techniques available for performing seismic facies classification. However, those classification methods either demand high computational costs or do not provide an accurate measure of confidence. Probabilistic neural networks (PNNs) overcome these limitations and have demonstrated their superiority among other algorithms. PNNs have been extensively applied for some prediction tasks, but not well studied regarding the prediction of seismic facies volumes using seismic attributes. We explore the capability of the PNN algorithm when classifying large- and small-scale seismic facies. Additionally, we evaluate the impact of user-chosen parameters on the final classification volumes. After performing seven tests, each with a parameter variation, we assess the impact of the parameter change on the resultant classification volumes. We show that the processing task can have a significant impact on the classification volumes, but also how the most geologically complex areas are the most challenging for the algorithm. Moreover, we demonstrate that even if the PNN technique is performing and producing considerably accurate results, it is possible to overcome those limitations and significantly improve the final classification volumes by including the geological insight provided by the geoscientist. We conclude by proposing a new workflow that can guide future geoscientists interested in applying PNNs, to obtain better seismic facies classification volumes by considering some initial steps and advice.


Sign in / Sign up

Export Citation Format

Share Document