Convolutional Recurrent Neural Networks for Text Classification

Recurrent neural network (RNN) and convolutional neural network (CNN) are two prevailing architectures used in text classification. Traditional approaches combine the strengths of these two networks by straightly streamlining them or linking features extracted from them. In this article, a novel approach is proposed to maintain the strengths of RNN and CNN to a great extent. In the proposed approach, a bi-directional RNN encodes each word into forward and backward hidden states. Then, a neural tensor layer is used to fuse bi-directional hidden states to get word representations. Meanwhile, a convolutional neural network is utilized to learn the importance of each word for text classification. Empirical experiments are conducted on several datasets for text classification. The superior performance of the proposed approach confirms its effectiveness.

Download Full-text

EMOTIONS RECOGNITION IN HUMAN SPEECH USING DEEP NEURAL NETWORKS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2021.01.pp.044-051 ◽

2021 ◽

pp. 44-51

Author(s):

E. Yu. Shchetinin

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Deep Neural Networks ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Audio Recordings ◽

Computer Studies

The recognition of human emotions is one of the most relevant and dynamically developing areas of modern speech technologies, and the recognition of emotions in speech (RER) is the most demanded part of them. In this paper, we propose a computer model of emotion recognition based on an ensemble of bidirectional recurrent neural network with LSTM memory cell and deep convolutional neural network ResNet18. In this paper, computer studies of the RAVDESS database containing emotional speech of a person are carried out. RAVDESS-a data set containing 7356 files. Entries contain the following emotions: 0 – neutral, 1 – calm, 2 – happiness, 3 – sadness, 4 – anger, 5 – fear, 6 – disgust, 7 – surprise. In total, the database contains 16 classes (8 emotions divided into male and female) for a total of 1440 samples (speech only). To train machine learning algorithms and deep neural networks to recognize emotions, existing audio recordings must be pre-processed in such a way as to extract the main characteristic features of certain emotions. This was done using Mel-frequency cepstral coefficients, chroma coefficients, as well as the characteristics of the frequency spectrum of audio recordings. In this paper, computer studies of various models of neural networks for emotion recognition are carried out on the example of the data described above. In addition, machine learning algorithms were used for comparative analysis. Thus, the following models were trained during the experiments: logistic regression (LR), classifier based on the support vector machine (SVM), decision tree (DT), random forest (RF), gradient boosting over trees – XGBoost, convolutional neural network CNN, recurrent neural network RNN (ResNet18), as well as an ensemble of convolutional and recurrent networks Stacked CNN-RNN. The results show that neural networks showed much higher accuracy in recognizing and classifying emotions than the machine learning algorithms used. Of the three neural network models presented, the CNN + BLSTM ensemble showed higher accuracy.

Download Full-text

Binary and Multiclass Text Classification by Means of Separable Convolutional Neural Network

Inventions ◽

10.3390/inventions6040070 ◽

2021 ◽

Vol 6 (4) ◽

pp. 70

Author(s):

Elena Solovyeva ◽

Ali Abdullah

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Networks ◽

Low Cost ◽

Computational Cost ◽

High Accuracy ◽

Activation Functions ◽

Fully Connected ◽

Fully Connected Networks

In this paper, the structure of a separable convolutional neural network that consists of an embedding layer, separable convolutional layers, convolutional layer and global average pooling is represented for binary and multiclass text classifications. The advantage of the proposed structure is the absence of multiple fully connected layers, which is used to increase the classification accuracy but raises the computational cost. The combination of low-cost separable convolutional layers and a convolutional layer is proposed to gain high accuracy and, simultaneously, to reduce the complexity of neural classifiers. Advantages are demonstrated at binary and multiclass classifications of written texts by means of the proposed networks under the sigmoid and Softmax activation functions in convolutional layer. At binary and multiclass classifications, the accuracy obtained by separable convolutional neural networks is higher in comparison with some investigated types of recurrent neural networks and fully connected networks.

Download Full-text

Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks

2018 International Conference on Signal Processing and Communications (SPCOM) ◽

10.1109/spcom.2018.8724471 ◽

2018 ◽

Author(s):

Chuya China ◽

Dipjyoti Bisharad ◽

Rabul Hussain Laskar

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Networks ◽

Short Term Memory ◽

Indian Languages ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Cascade convolutional neural network‐long short‐term memory recurrent neural networks for automatic tonal and nontonal preclassification‐based Indian language identification

Expert Systems ◽

10.1111/exsy.12544 ◽

2020 ◽

Vol 37 (5) ◽

Author(s):

Chuya China Bhanja ◽

Mohammad A. Laskar ◽

Rabul H. Laskar

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Networks ◽

Short Term Memory ◽

Language Identification ◽

Short Term ◽

Indian Language ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Exploring Efficient Neural Architectures for Linguistic–Acoustic Mapping in Text-To-Speech

Applied Sciences ◽

10.3390/app9163391 ◽

2019 ◽

Vol 9 (16) ◽

pp. 3391 ◽

Cited By ~ 1

Author(s):

Santiago Pascual ◽

Joan Serrà ◽

Antonio Bonafonte

Keyword(s):

Neural Network ◽

Neural Networks ◽

Recurrent Neural Network ◽

Recurrent Neural Networks ◽

Affine Transformations ◽

Text To Speech ◽

Recursive Structure ◽

The One ◽

Acoustic Mapping ◽

Symbol Sequences

Conversion from text to speech relies on the accurate mapping from linguistic to acoustic symbol sequences, for which current practice employs recurrent statistical models such as recurrent neural networks. Despite the good performance of such models (in terms of low distortion in the generated speech), their recursive structure with intermediate affine transformations tends to make them slow to train and to sample from. In this work, we explore two different mechanisms that enhance the operational efficiency of recurrent neural networks, and study their performance–speed trade-off. The first mechanism is based on the quasi-recurrent neural network, where expensive affine transformations are removed from temporal connections and placed only on feed-forward computational directions. The second mechanism includes a module based on the transformer decoder network, designed without recurrent connections but emulating them with attention and positioning codes. Our results show that the proposed decoder networks are competitive in terms of distortion when compared to a recurrent baseline, whilst being significantly faster in terms of CPU and GPU inference time. The best performing model is the one based on the quasi-recurrent mechanism, reaching the same level of naturalness as the recurrent neural network based model with a speedup of 11.2 on CPU and 3.3 on GPU.

Download Full-text

Adaptive Capability of Recurrent Neural Networks with Fixed Weights for Series-Parallel System Identification

Neural Computation ◽

10.1162/neco.2009.06-07-542 ◽

2009 ◽

Vol 21 (11) ◽

pp. 3214-3227

Author(s):

James Ting-Ho Lo

Keyword(s):

Neural Network ◽

Dynamical System ◽

Neural Networks ◽

System Identification ◽

Recurrent Neural Network ◽

Recurrent Neural Networks ◽

Additional Input ◽

Adaptive Capability ◽

Environmental Process ◽

Optimal Series

By a fundamental neural filtering theorem, a recurrent neural network with fixed weights is known to be capable of adapting to an uncertain environment. This letter reports some mathematical results on the performance of such adaptation for series-parallel identification of a dynamical system as compared with the performance of the best series-parallel identifier possible under the assumption that the precise value of the uncertain environmental process is given. In short, if an uncertain environmental process is observable (not necessarily constant) from the output of a dynamical system or constant (not necessarily observable), then a recurrent neural network exists as a series-parallel identifier of the dynamical system whose output approaches the output of an optimal series-parallel identifier using the environmental process as an additional input.

Download Full-text

Relating the Slope of the Activation Function and the Learning Rate Within a Recurrent Neural Network

Neural Computation ◽

10.1162/089976699300016340 ◽

1999 ◽

Vol 11 (5) ◽

pp. 1069-1077 ◽

Cited By ~ 28

Author(s):

Danilo P. Mandic ◽

Jonathon A. Chambers

Keyword(s):

Neural Network ◽

Neural Networks ◽

Recurrent Neural Network ◽

Recurrent Neural Networks ◽

Degrees Of Freedom ◽

Learning Algorithm ◽

Activation Function ◽

Learning Rate ◽

Optimization Task ◽

Nonlinear Activation Function

A relationship between the learning rate η in the learning algorithm, and the slope β in the nonlinear activation function, for a class of recurrent neural networks (RNNs) trained by the real-time recurrent learning algorithm is provided. It is shown that an arbitrary RNN can be obtained via the referent RNN, with some deterministic rules imposed on its weights and the learning rate. Such relationships reduce the number of degrees of freedom when solving the nonlinear optimization task of finding the optimal RNN parameters.

Download Full-text

Real‐Time Arrhythmia Detection Using Hybrid Convolutional Neural Networks

Journal of the American Heart Association ◽

10.1161/jaha.121.023222 ◽

2021 ◽

Author(s):

Sandeep Chandra Bollepalli ◽

Rahul K. Sevakula ◽

Wan‐Tai M. Au‐Yeung ◽

Mohamad B. Kassab ◽

Faisal M. Merchant ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

High Rate ◽

Superior Performance ◽

False Alarms ◽

Arrhythmia Detection ◽

Arterial Blood ◽

Life Threatening

Background Accurate detection of arrhythmic events in the intensive care units (ICU) is of paramount significance in providing timely care. However, traditional ICU monitors generate a high rate of false alarms causing alarm fatigue. In this work, we develop an algorithm to improve life threatening arrhythmia detection in the ICUs using a deep learning approach. Methods and Results This study involves a total of 953 independent life‐threatening arrhythmia alarms generated from the ICU bedside monitors of 410 patients. Specifically, we used the ECG (4 channels), arterial blood pressure, and photoplethysmograph signals to accurately detect the onset and offset of various arrhythmias, without prior knowledge of the alarm type. We used a hybrid convolutional neural network based classifier that fuses traditional handcrafted features with features automatically learned using convolutional neural networks. Further, the proposed architecture remains flexible to be adapted to various arrhythmic conditions as well as multiple physiological signals. Our hybrid‐ convolutional neural network approach achieved superior performance compared with methods which only used convolutional neural network. We evaluated our algorithm using 5‐fold cross‐validation for 5 times and obtained an accuracy of 87.5%±0.5%, and a score of 81%±0.9%. Independent evaluation of our algorithm on the publicly available PhysioNet 2015 Challenge database resulted in overall classification accuracy and score of 93.9% and 84.3%, respectively, indicating its efficacy and generalizability. Conclusions Our method accurately detects multiple arrhythmic conditions. Suitable translation of our algorithm may significantly improve the quality of care in ICUs by reducing the burden of false alarms.

Download Full-text

Weighted Automata Extraction from Recurrent Neural Networks via Regression on State Spaces

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.5977 ◽

2020 ◽

Vol 34 (04) ◽

pp. 5306-5314

Author(s):

Takamasa Okudono ◽

Masaki Waga ◽

Taro Sekiyama ◽

Ichiro Hasuo

Keyword(s):

Neural Network ◽

Neural Networks ◽

Recurrent Neural Network ◽

Recurrent Neural Networks ◽

Learning Algorithm ◽

Internal State ◽

State Spaces ◽

Regression Methods ◽

Weighted Automata ◽

Equivalence Queries

We present a method to extract a weighted finite automaton (WFA) from a recurrent neural network (RNN). Our method is based on the WFA learning algorithm by Balle and Mohri, which is in turn an extension of Angluin's classic L* algorithm. Our technical novelty is in the use of regression methods for the so-called equivalence queries, thus exploiting the internal state space of an RNN to prioritize counterexample candidates. This way we achieve a quantitative/weighted extension of the recent work by Weiss, Goldberg and Yahav that extracts DFAs. We experimentally evaluate the accuracy, expressivity and efficiency of the extracted WFAs.

Download Full-text

On Recurrent Neural Network Based Theorem Prover For First Order Minimal Logic

JUCS - Journal of Universal Computer Science ◽

10.3897/jucs.76563 ◽

2021 ◽

Vol 27 (11) ◽

pp. 1193-1202

Author(s):

Ashot Baghdasaryan ◽

Hovhannes Bolibekyan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Recurrent Neural Network ◽

Theorem Proving ◽

Recurrent Neural Networks ◽

Selection Problem ◽

Theorem Prover ◽

Minimal Logic ◽

Free System ◽

First Order

There are three main problems for theorem proving with a standard cut-free system for the first order minimal logic. The first problem is the possibility of looping. Secondly, it might generate proofs which are permutations of each other. Finally, during the proof some choice should be made to decide which rules to apply and where to use them. New systems with history mechanisms were introduced for solving the looping problems of automated theorem provers in the first order minimal logic. In order to solve the rule selection problem, recurrent neural networks are deployed and they are used to determine which formula from the context should be used on further steps. As a result, it yields to the reduction of time during theorem proving.

Download Full-text