Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

The performance of a machine learning model depends on the quality of the features used as input to the model. Research into feature extraction methods for convolutional neural network (CNN)-based diagnostics for rotating machinery remains in a developmental stage. In general, the input to CNN-based diagnostics consists of a spectrogram without significant pre-processing. This paper introduces octave-band filtering as a feature extraction method for preprocessing a spectrogram prior to use with CNN. This method is an adaptation of a feature extraction method originally developed for speech recognition. The method developed for diagnosis of machinery faults differs from filtering methods applied to speech recognition in its use of octave bands, to which weighting has been applied that is optimal for machinery diagnosis. Through a case study, the effectiveness of octave-band filtering is demonstrated. The method not only improves the accuracy of the CNN-based diagnostics but also reduces the size of the CNN.

Download Full-text

Improving Myanmar Automatic Speech Recognition with Optimization of Convolutional Neural Network Parameters

International Journal on Natural Language Computing ◽

10.5121/ijnlc.2018.7601 ◽

2018 ◽

Vol 7 (6) ◽

pp. 1-10 ◽

Cited By ~ 1

Author(s):

Aye Nyein Mon ◽

Win Pa Pa ◽

Ye Kyaw Thu

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolutional Neural Network ◽

Automatic Speech Recognition ◽

Network Parameters

Download Full-text

A Light-weight Convolutional Neural Network based Speech Recognition for Spoken Content Retrieval Task

2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc42975.2020.9282956 ◽

2020 ◽

Author(s):

Nirayo Hailu Gebreegziabher ◽

Andreas Nurnberger

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolutional Neural Network ◽

Light Weight ◽

Retrieval Task ◽

Content Retrieval

Download Full-text

A High Accuracy Multiple-Command Speech Recognition ASIC Based on Configurable One-Dimension Convolutional Neural Network

2021 IEEE International Symposium on Circuits and Systems (ISCAS) ◽

10.1109/iscas51556.2021.9401401 ◽

2021 ◽

Author(s):

Lindong Wu ◽

Zongwei Wang ◽

Ming Zhao ◽

Wei Hu ◽

Yimao Cai ◽

...

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolutional Neural Network ◽

High Accuracy ◽

One Dimension

Download Full-text

Deep convolutional neural networks for human movement detection using wireless signals

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189629 ◽

2021 ◽

pp. 1-10

Author(s):

Chien-Cheng Leea ◽

Zhongjian Gao ◽

Xiu-Chi Huanga

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Detection System ◽

Deep Convolutional Neural Network ◽

Human Detection ◽

Two Dimensional ◽

Dimensional Matrix ◽

State Classification ◽

Propagation Paths

This paper proposes a Wi-Fi-based indoor human detection system using a deep convolutional neural network. The system detects different human states in various situations, including different environments and propagation paths. The main improvements proposed by the system is that there is no cameras overhead and no sensors are mounted. This system captures useful amplitude information from the channel state information and converts this information into an image-like two-dimensional matrix. Next, the two-dimensional matrix is used as an input to a deep convolutional neural network (CNN) to distinguish human states. In this work, a deep residual network (ResNet) architecture is used to perform human state classification with hierarchical topological feature extraction. Several combinations of datasets for different environments and propagation paths are used in this study. ResNet’s powerful inference simplifies feature extraction and improves the accuracy of human state classification. The experimental results show that the fine-tuned ResNet-18 model has good performance in indoor human detection, including people not present, people still, and people moving. Compared with traditional machine learning using handcrafted features, this method is simple and effective.

Download Full-text

Isolated Word Speech Recognition Using Convolutional Neural Network

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429684 ◽

2021 ◽

Author(s):

Aljenan Soliman ◽

Salah Mohamed ◽

Iman Abuelmaaly Abdelrahman

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolutional Neural Network ◽

Isolated Word

Download Full-text

A Study of Spatial-Spectral Feature Extraction frameworks with 3D Convolutional Neural Network for Robust Hyperspectral Imagery Classification

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2020.3046414 ◽

2020 ◽

pp. 1-1

Author(s):

Bishwas Praveen ◽

Vineetha Menon

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Hyperspectral Imagery ◽

Spectral Feature

Download Full-text

AUTOMATED SCREENING OF DIABETIC RETINOPATHY WITH OPTIMIZED DEEP CONVOLUTIONAL NEURAL NETWORK: ENHANCED MOTH FLAME MODEL

Journal of Mechanics in Medicine and Biology ◽

10.1142/s0219519421500056 ◽

2021 ◽

Vol 21 (01) ◽

pp. 2150005

Author(s):

ARUN T NAIR ◽

K. MUTHUVEL

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Blood Vessel ◽

Convolutional Neural Network ◽

Optimization Algorithm ◽

Vessel Segmentation ◽

Deep Convolutional Neural Network ◽

Gray Level ◽

Pass Filter ◽

Blood Vessel Segmentation

Nowadays, analysis on retinal image exists as one of the challenging area for study. Numerous retinal diseases could be recognized by analyzing the variations taking place in retina. However, the main disadvantage among those studies is that, they do not have higher recognition accuracy. The proposed framework includes four phases namely, (i) Blood Vessel Segmentation (ii) Feature Extraction (iii) Optimal Feature Selection and (iv) Classification. Initially, the input fundus image is subjected to blood vessel segmentation from which two binary thresholded images (one from High Pass Filter (HPF) and other from top-hat reconstruction) are acquired. These two images are differentiated and the areas that are common to both are said to be the major vessels and the left over regions are fused to form vessel sub-image. These vessel sub-images are classified with Gaussian Mixture Model (GMM) classifier and the resultant is summed up with the major vessels to form the segmented blood vessels. The segmented images are subjected to feature extraction process, where the features like proposed Local Binary Pattern (LBP), Gray-Level Co-Occurrence Matrix (GLCM) and Gray Level Run Length Matrix (GLRM) are extracted. As the curse of dimensionality seems to be the greatest issue, it is important to select the appropriate features from the extracted one for classification. In this paper, a new improved optimization algorithm Moth Flame with New Distance Formulation (MF-NDF) is introduced for selecting the optimal features. Finally, the selected optimal features are subjected to Deep Convolutional Neural Network (DCNN) model for classification. Further, in order to make the precise diagnosis, the weights of DCNN are optimally tuned by the same optimization algorithm. The performance of the proposed algorithm will be compared against the conventional algorithms in terms of positive and negative measures.

Download Full-text

Joint Feature Extraction for Multispectral and Panchromatic Images Based on Convolutional Neural Network

IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2018.8518885 ◽

2018 ◽

Author(s):

Yi Chen ◽

Mengmeng Zhang ◽

Wei Li ◽

Qian Du

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Convolutional Neural Network

Download Full-text

Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

Convolutional Neural Network for Automatic Speech Recognition of Filipino Language

Octave-band Filtering for Convolutional Neural Network-based Diagnostics for Rotating Machinery

Improving Myanmar Automatic Speech Recognition with Optimization of Convolutional Neural Network Parameters

A Light-weight Convolutional Neural Network based Speech Recognition for Spoken Content Retrieval Task

A High Accuracy Multiple-Command Speech Recognition ASIC Based on Configurable One-Dimension Convolutional Neural Network

Deep convolutional neural networks for human movement detection using wireless signals

Isolated Word Speech Recognition Using Convolutional Neural Network

A Study of Spatial-Spectral Feature Extraction frameworks with 3D Convolutional Neural Network for Robust Hyperspectral Imagery Classification

AUTOMATED SCREENING OF DIABETIC RETINOPATHY WITH OPTIMIZED DEEP CONVOLUTIONAL NEURAL NETWORK: ENHANCED MOTH FLAME MODEL

Joint Feature Extraction for Multispectral and Panchromatic Images Based on Convolutional Neural Network

Export Citation Format