Research on advertising content recognition based on convolutional neural network and recurrent neural network

Plant phenotypic image recognition (PPIR) is an important branch of smart agriculture. In recent years, deep learning has achieved significant breakthroughs in image recognition. Consequently, PPIR technology that is based on deep learning is becoming increasingly popular. First, this paper introduces the development and application of PPIR technology, followed by its classification and analysis. Second, it presents the theory of four types of deep learning methods and their applications in PPIR. These methods include the convolutional neural network, deep belief network, recurrent neural network, and stacked autoencoder, and they are applied to identify plant species, diagnose plant diseases, etc. Finally, the difficulties and challenges of deep learning in PPIR are discussed.

Download Full-text

EMOTIONS RECOGNITION IN HUMAN SPEECH USING DEEP NEURAL NETWORKS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2021.01.pp.044-051 ◽

2021 ◽

pp. 44-51

Author(s):

E. Yu. Shchetinin

Keyword(s):

Neural Network ◽

Machine Learning ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Deep Neural Networks ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Audio Recordings ◽

Computer Studies

The recognition of human emotions is one of the most relevant and dynamically developing areas of modern speech technologies, and the recognition of emotions in speech (RER) is the most demanded part of them. In this paper, we propose a computer model of emotion recognition based on an ensemble of bidirectional recurrent neural network with LSTM memory cell and deep convolutional neural network ResNet18. In this paper, computer studies of the RAVDESS database containing emotional speech of a person are carried out. RAVDESS-a data set containing 7356 files. Entries contain the following emotions: 0 – neutral, 1 – calm, 2 – happiness, 3 – sadness, 4 – anger, 5 – fear, 6 – disgust, 7 – surprise. In total, the database contains 16 classes (8 emotions divided into male and female) for a total of 1440 samples (speech only). To train machine learning algorithms and deep neural networks to recognize emotions, existing audio recordings must be pre-processed in such a way as to extract the main characteristic features of certain emotions. This was done using Mel-frequency cepstral coefficients, chroma coefficients, as well as the characteristics of the frequency spectrum of audio recordings. In this paper, computer studies of various models of neural networks for emotion recognition are carried out on the example of the data described above. In addition, machine learning algorithms were used for comparative analysis. Thus, the following models were trained during the experiments: logistic regression (LR), classifier based on the support vector machine (SVM), decision tree (DT), random forest (RF), gradient boosting over trees – XGBoost, convolutional neural network CNN, recurrent neural network RNN (ResNet18), as well as an ensemble of convolutional and recurrent networks Stacked CNN-RNN. The results show that neural networks showed much higher accuracy in recognizing and classifying emotions than the machine learning algorithms used. Of the three neural network models presented, the CNN + BLSTM ensemble showed higher accuracy.

Download Full-text

Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification

Artificial Intelligence in Medicine ◽

10.1016/j.artmed.2018.11.004 ◽

2019 ◽

Vol 97 ◽

pp. 79-88 ◽

Cited By ~ 30

Author(s):

Imon Banerjee ◽

Yuan Ling ◽

Matthew C. Chen ◽

Sadid A. Hasan ◽

Curtis P. Langlotz ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Comparative Effectiveness ◽

Text Report

Download Full-text

Improving Bug Localization with Character-Level Convolutional Neural Network and Recurrent Neural Network

2018 25th Asia-Pacific Software Engineering Conference (APSEC) ◽

10.1109/apsec.2018.00097 ◽

2018 ◽

Cited By ~ 1

Author(s):

Yan Xiao ◽

Jacky Keung

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Bug Localization

Download Full-text

Real-time event detection using recurrent neural network in social sensors

International Journal of Distributed Sensor Networks ◽

10.1177/1550147719856492 ◽

2019 ◽

Vol 15 (6) ◽

pp. 155014771985649 ◽

Cited By ~ 2

Author(s):

Van Quan Nguyen ◽

Tien Nguyen Anh ◽

Hyung-Jeong Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Language Processing ◽

Recurrent Neural Network ◽

Event Detection ◽

Word Embedding ◽

Series Data ◽

Accuracy Score ◽

Data Set ◽

Size Limitation

We proposed an approach for temporal event detection using deep learning and multi-embedding on a set of text data from social media. First, a convolutional neural network augmented with multiple word-embedding architectures is used as a text classifier for the pre-processing of the input textual data. Second, an event detection model using a recurrent neural network is employed to learn time series data features by extracting temporal information. Recently, convolutional neural networks have been used in natural language processing problems and have obtained excellent results as performing on available embedding vector. In this article, word-embedding features at the embedding layer are combined and fed to convolutional neural network. The proposed method shows no size limitation, supplementation of more embeddings than standard multichannel based approaches, and obtained similar performance (accuracy score) on some benchmark data sets, especially in an imbalanced data set. For event detection, a long short-term memory network is used as a predictor that learns higher level temporal features so as to predict future values. An error distribution estimation model is built to calculate the anomaly score of observation. Events are detected using a window-based method on the anomaly scores.

Download Full-text

Faster Region-Convolutional Neural network oriented feature learning with optimal trained Recurrent Neural Network for bone age assessment for pediatrics

Biomedical Signal Processing and Control ◽

10.1016/j.bspc.2021.103016 ◽

2022 ◽

Vol 71 ◽

pp. 103016

Author(s):

Sonal Deshmukh ◽

Arti Khaparde

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Feature Learning ◽

Bone Age ◽

Age Assessment ◽

Bone Age Assessment

Download Full-text

Classification of Analyzed Text in Speech Recognition Using RNN-LSTM in Comparison with Convolutional Neural Network to Improve Precision for Identification of Keywords

Revista Gestão Inovação e Tecnologias ◽

10.47059/revistageintec.v11i2.1739 ◽

2021 ◽

Vol 11 (2) ◽

pp. 1097-1108

Author(s):

Bathaloori Reddy Prasad

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Short Term Memory ◽

Statistical Significance ◽

Language Translation ◽

Accuracy And Precision ◽

Long Short Term Memory

Aim: Text classification is a method to classify the features from language translation in speech recognition from English to Telugu using a recurrent neural network- long short term memory (RNN-LSTM) comparison with convolutional neural network (CNN). Materials and Methods: Accuracy and precision are performed with dataset alexa and english-telugu of size 8166 sentences. Classification of language translation is performed by the recurrent neural network where a number of the samples (N=62) and convolutional neural network were a number of samples (N=62) techniques, the algorithm RNN implies speech recognition that can be compared with convolutional is the second technique. Results and Discussion: RNN-LSTM from the dataset speech recognition, feature Telugu_id produce accuracy 93% and precision 68.04% which can be comparatively higher than CNN accuracy 66.11%, precision 61.90%. It shows a statistical significance as 0.007 from Independent Sample T-test. Conclusion: The RNN-LSTM performs better in finding accuracy and precision when compared to CNN.

Download Full-text

Flood Detection from Satellite Images Based on Deep Convolutional Neural Network and Layered Recurrent Neural Network

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.e3144.039520 ◽

2020 ◽

Vol 9 (5) ◽

pp. 2041-2045

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Recurrent Neural Network ◽

Satellite Images ◽

Satellite Image ◽

High Accuracy ◽

Training Phase ◽

Training Set ◽

Flood Detection ◽

And Training

Satellite images are important for developing and protected environmental resources that can be used for flood detection. The satellite image of before-flooding and after-flooding to be segmented and feature with integration of deeply LRNN and CNN networks for giving high accuracy. It is also important for learning LRNN and CNN is able to find the feature of flooding regions sufficiently and, it will influence the effectiveness of flood relief. The CNNs and LRNNs consists of two set are training set and testing set. The before flooding and after flooding of satellite images to be extract and segment formed by testing and training phase of data patches. All patches are trained by LRNN where changes occur or any misdetection of flooded region to extract accurately without delay. This proposed method obtain accuracy of system is 99% of flood region detections.

Download Full-text