Research on End-to-end Voiceprint Recognition Model Based on Convolutional Neural Network

Speech signal is a time-varying signal, which is greatly affected by individual and environment. In order to improve the end-to-end voice print recognition rate, it is necessary to preprocess the original speech signal to some extent. An end-to-end voiceprint recognition algorithm based on convolutional neural network is proposed. In this algorithm, the convolution and down-sampling of convolutional neural network are used to preprocess the speech signals in end-to-end voiceprint recognition. The one-dimensional and two-dimensional convolution operations were established to extract the characteristic parameters of Meier frequency cepstrum coefficient from the preprocessed signals, and the classical universal background model was used to model the recognition model of voice print. In this study, the principle of end-to-end voiceprint recognition was firstly analyzed, and the process of end-to-end voice print recognition, end-to-end voice print recognition features and Res-FD-CNN network structure were studied. Then the convolutional neural network recognition model was constructed, and the data were preprocessed to form the convolutional layer in frequency domain and the algorithm was tested.

Download Full-text

End-to-End Speech Emotion Recognition Based on One-Dimensional Convolutional Neural Network

Proceedings of the 2019 3rd International Conference on Innovation in Artificial Intelligence - ICIAI 2019 ◽

10.1145/3319921.3319963 ◽

2019 ◽

Cited By ~ 1

Author(s):

Mengna Gao ◽

Jing Dong ◽

Dongsheng Zhou ◽

Qiang Zhang ◽

Deyun Yang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Emotion Recognition ◽

Speech Emotion Recognition ◽

One Dimensional ◽

End To End

Download Full-text

Fault Detection of Reciprocating Compressor Valve Based on One-Dimensional Convolutional Neural Network

Mathematical Problems in Engineering ◽

10.1155/2020/8058723 ◽

2020 ◽

Vol 2020 ◽

pp. 1-10

Author(s):

Fu-yan Guo ◽

Yan-chao Zhang ◽

Yue Wang ◽

Ping Wang ◽

Pei-jun Ren ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Oil And Gas ◽

Recognition Rate ◽

Reciprocating Compressor ◽

One Dimensional ◽

Healthy Development ◽

Fault Recognition ◽

Diagnosis Model ◽

Artificial Neural Network Ann

Reciprocating compressors are important equipment in oil and gas industries which closely relate with the healthy development of the enterprise. It is essential to detect the valve fault because valve failures account for 60% in total failures. For this field, an artificial neural network (ANN) is widely used, but a complex network is not suitable for its low accuracy and easy overfitting. This paper proposes a fault diagnosis model of a reciprocating compressor valve based on a one-dimensional convolutional neural network (1DCNN). This method takes the differential pressure and differential temperature of each compressor stage as the input of 1DCNN, using the characteristics of the CNN to extract the features and finally using Softmax to classify the fault. In order to verify this method, it is compared with LM-BP, RBF, and BP neural networks. The results show that the fault recognition rate of 1DCNN reaches 100%, which proves the effectiveness and feasibility of the proposed method.

Download Full-text

Human Activity Recognition Algorithm Based on One-Dimensional Convolutional Neural Network

Revue d intelligence artificielle ◽

10.18280/ria.340110 ◽

2020 ◽

Vol 34 (1) ◽

pp. 75-80

Author(s):

Dengping Tang ◽

Miao Jin ◽

Quan Wang ◽

Wei Zhou ◽

Jun Zhang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Activity Recognition ◽

Human Activity ◽

Human Activity Recognition ◽

Recognition Algorithm ◽

One Dimensional

Download Full-text

Electrocardiogram Classification Based on Faster Regions with Convolutional Neural Network

Sensors ◽

10.3390/s19112558 ◽

2019 ◽

Vol 19 (11) ◽

pp. 2558 ◽

Cited By ~ 13

Author(s):

Yinsheng Ji ◽

Sen Zhang ◽

Wendong Xiao

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

System Development ◽

Support Vector ◽

One Dimensional ◽

Ecg Signals ◽

Two Dimensional Image ◽

Average Accuracy ◽

Effective System ◽

The One

The classification of electrocardiograms (ECG) plays an important role in the clinical diagnosis of heart disease. This paper proposes an effective system development and implementation for ECG classification based on faster regions with a convolutional neural network (Faster R-CNN) algorithm. The original one-dimensional ECG signals contain the preprocessed patient ECG signals and some ECG recordings from the MIT-BIH database in this experiment. Each ECG beat of one-dimensional ECG signals was transformed into a two-dimensional image for experimental training sets and test sets. As a result, we classified the ECG beats into five categories with an average accuracy of 99.21%. In addition, we did a comparative experiment using the one versus rest support vector machine (OVR SVM) algorithm, and the classification accuracy of the proposed Faster R-CNN was shown to be 2.59% higher.

Download Full-text

Classification Modeling Method for Near-Infrared Spectroscopy of Tobacco Based on Multimodal Convolution Neural Networks

Journal of Analytical Methods in Chemistry ◽

10.1155/2020/9652470 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13 ◽

Cited By ~ 2

Author(s):

Lei Zhang ◽

Xiangqian Ding ◽

Ruichun Hou

Keyword(s):

Neural Network ◽

Neural Networks ◽

Infrared Spectroscopy ◽

Convolutional Neural Network ◽

Near Infrared Spectroscopy ◽

Near Infrared ◽

Two Dimensional ◽

One Dimensional ◽

Nirs Data ◽

The One

The origin of tobacco is the most important factor in determining the style characteristics and intrinsic quality of tobacco. There are many applications for the identification of tobacco origin by near-infrared spectroscopy. In order to improve the accuracy of the tobacco origin classification, a near-infrared spectrum (NIRS) identification method based on multimodal convolutional neural networks (CNN) was proposed, taking advantage of the strong feature extraction ability of the CNN. Firstly, the one-dimensional convolutional neural network (1-D CNN) is used to extract and combine the pattern features of one-dimensional NIRS data, and then the extracted features are used for classification. Secondly, the one-dimensional NIRS data are converted into two-dimensional spectral images, and the structure features are extracted from two-dimensional spectral images by the two-dimensional convolutional neural network (2-D CNN) method. The classification is performed by the combination of global and local training features. Finally, the influences of different network structure parameters on model identification performance are studied, and the optimal CNN models are selected and compared. The multimodal NIR-CNN identification models of tobacco origin were established by using NIRS of 5,200 tobacco samples from 10 major tobacco producing provinces in China and 3 foreign countries. The classification accuracy of 1-D CNN and 2-D CNN models was 93.15% and 93.05%, respectively, which was better than the traditional PLS-DA method. The experimental results show that the application of 1-D CNN and 2-D CNN can accurately and reliably distinguish the NIRS data, and it can be developed into a new rapid identification method of tobacco origin, which has an important promotion value.

Download Full-text

Study of Human Motion Recognition Algorithm Based on Multichannel 3D Convolutional Neural Network

Complexity ◽

10.1155/2021/7646813 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Yang Ju

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Optical Flow ◽

Recognition Rate ◽

Linear Representation ◽

Random Projection ◽

Recognition Algorithm ◽

Human Motion ◽

Motion Recognition ◽

Human Motion Recognition

Aiming at the problem that it is difficult to balance the speed and accuracy of human behaviour recognition, this paper proposes a method of motion recognition based on random projection. Firstly, the optical flow picture and Red, Green, Blue (RGB) picture obtained by the Lucas-Kanade algorithm are used. Secondly, the data of optical flow pictures and RGB pictures are compressed based on a random projection matrix of compressed sensing, which effectively reduces power consumption. At the same time, based on random projection compression data, it can effectively find the optimal linear representation to reconstruct training samples and test samples. Thirdly, a multichannel 3D convolutional neural network is proposed, and the multiple information extracted by the network is fused to form an output recognizer. Experimental results show that the algorithm in this paper significantly improves the recognition rate of multicategory actions and effectively reduces the computational complexity and running time of the recognition algorithm.

Download Full-text

Dual-channel convolutional neural network for power edge image recognition

Journal of Cloud Computing Advances Systems and Applications ◽

10.1186/s13677-021-00235-9 ◽

2021 ◽

Vol 10 (1) ◽

Author(s):

Fangrong Zhou ◽

Yi Ma ◽

Bo Wang ◽

Gang Lin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Recognition ◽

Recognition Rate ◽

Recognition Algorithm ◽

Classification Method ◽

Power Equipment ◽

Recognition Method ◽

Edge Image ◽

Dual Channel

AbstractIn view of the low accuracy and poor processing capacity of traditional power equipment image recognition methods, this paper proposes a power equipment image recognition method based on a dual-channel convolutional neural network (DC-CNN) model and random forest (RF) classification. In the aspect of feature extraction, the DC-CNN model extracts the characteristics of power equipment through two independent CNN models. In the aspect of the recognition algorithm, by referring to the advantages of the traditional machine learning method and incorporating the advantages of the RF, an RF classification method incorporating deep learning is proposed. Finally, the proposed DC-CNN model and RF classification method are used to classify images of various types of power equipment. The results show that the proposed methods can be effectively applied to the image recognition of various types of power equipment, and they greatly improve the recognition rate of power equipment images.

Download Full-text