Effective Technique for Noise Removal and Emotion Recognition in Speech Signals Using Cat Swarm Optimized Spiking Neural Networks

Author(s):  
C. Revathy ◽  
R. Sureshbabu

Speech processing is one of the required fields in digital signal processing that helps in processing the speech signals. The speech process is utilized in different fields such as emotion recognition, virtual assistants, voice identification, etc. Among the various applications, emotion recognition is one of the critical areas because it is used to recognize people’s exact emotions and eliminate physiological issues. Several researchers utilize signal processing and machine learning techniques together to find the exact human emotions. However, they fail to attain their feelings with less computational complexity and high accuracy. This paper introduces the intelligent computational technique called cat swarm optimized spiking neural network (CSSPNN). Initially, the emotional speech signal is collected from the Toronto emotional speech set (TESS) dataset, which is then processed by applying a wavelet approach to extract the features. The derived features are further examined using the defined classifier CSSPNN, which recognizes human emotions due to the effective training and learning process. Finally, the proficiency of the system is determined using experimental results and discussions. The proposed system recognizes the speech emotions up to 99.3% accuracy compared to recurrent neural networks (RNNs), deep neural networks (DNNs) and deep shallow neural networks (DSNNs).

2011 ◽  
Vol 121-126 ◽  
pp. 815-819 ◽  
Author(s):  
Yu Qiang Qin ◽  
Xue Ying Zhang

Ensemble empirical mode decomposition(EEMD) is a newly developed method aimed at eliminating mode mixing present in the original empirical mode decomposition (EMD). To evaluate the performance of this new method, this paper investigates the effect of two parameters pertinent to EEMD: the emotional envelop and the number of emotional ensemble trials. At the same time, the proposed technique has been utilized for four kinds of emotional(angry、happy、sad and neutral) speech signals, and compute the number of each emotional ensemble trials. We obtain an emotional envelope by transforming the IMFe of emotional speech signals, and obtain a new method of emotion recognition according to different emotional envelop and emotional ensemble trials.


2020 ◽  
Vol 17 (8) ◽  
pp. 3786-3789
Author(s):  
P. Gayathri ◽  
P. Gowri Priya ◽  
L. Sravani ◽  
Sandra Johnson ◽  
Visanth Sampath

Recognition of emotions is the aspect of speech recognition that is gaining more attention and the need for it is growing enormously. Although there are methods to identify emotion using machine learning techniques, we assume in this paper that calculating deltas and delta-deltas for customized features not only preserves effective emotional information, but also that the impact of irrelevant emotional factors, leading to a reduction in misclassification. Furthermore, Speech Emotion Recognition (SER) often suffers from the silent frames and irrelevant emotional frames. Meanwhile, the process of attention has demonstrated exceptional performance in learning related feature representations for specific tasks. Inspired by this, propose a Convolutionary Recurrent Neural Networks (ACRNN) based on Attention to learn discriminative features for SER, where the Mel-spectrogram with deltas and delta-deltas is used as input. Finally, experimental results show the feasibility of the proposed method and attain state-of-the-art performance in terms of unweighted average recall.


Sensors ◽  
2021 ◽  
Vol 21 (18) ◽  
pp. 6288
Author(s):  
Sarahi Nicole Castro Pérez ◽  
Stelian Alexandru Borz

Forestry is a complex economic sector which is relying on resource and process monitoring data. Most of the forest operations such as planting and harvesting are supported by the use of tools and machines, and their monitoring has been traditionally done by the use of pen-and-paper time studies. Nevertheless, modern data collection and analysis methods involving different kinds of platforms and machine learning techniques have been studied lately with the aim of easing the data management process. By their outcomes, improvements are still needed to reach a close to 100% activity recognition, which may depend on several factors such as the type of monitored process and the characteristics of the signals used as inputs. In this paper, we test, thought a case study on mechanized pit-drilling operations, the potential of digital signal processing techniques combined with Artificial Neural Networks (ANNs) in improving the event-based classification accuracy in the time domain. Signal processing was implemented by the means of median filtering of triaxial accelerometer data (window sizes of 3, 5, and up to 21 observations collected at 1 Hz) while the ANNs were subjected to the regularization hyperparameter’s tunning. An acceleration signal processed by a median filter with a window size of 3 observations and fed into an ANN set to learn and generalize by a regularization parameter of α = 0.01 has been found to be the best strategy in improving the event-based classification accuracy (improvements of 1% to 8% in classification accuracy depending on the type of event in question). Improvement of classification accuracy by signal filtering and ANN tuning may depend largely on the type of monitored process and its outcomes in terms of event duration; therefore, other monitoring applications may need particular designs of signal processing and ANN tuning.


2020 ◽  
Author(s):  
Zheng Lian ◽  
Jianhua Tao ◽  
Bin Liu ◽  
Jian Huang ◽  
Zhanlei Yang ◽  
...  

Biomolecules ◽  
2021 ◽  
Vol 11 (4) ◽  
pp. 500
Author(s):  
László Keresztes ◽  
Evelin Szögi ◽  
Bálint Varga ◽  
Viktor Farkas ◽  
András Perczel ◽  
...  

The amyloid state of proteins is widely studied with relevance to neurology, biochemistry, and biotechnology. In contrast with nearly amorphous aggregation, the amyloid state has a well-defined structure, consisting of parallel and antiparallel β-sheets in a periodically repeated formation. The understanding of the amyloid state is growing with the development of novel molecular imaging tools, like cryogenic electron microscopy. Sequence-based amyloid predictors were developed, mainly using artificial neural networks (ANNs) as the underlying computational technique. From a good neural-network-based predictor, it is a very difficult task to identify the attributes of the input amino acid sequence, which imply the decision of the network. Here, we present a linear Support Vector Machine (SVM)-based predictor for hexapeptides with correctness higher than 84%, i.e., it is at least as good as the best published ANN-based tools. Unlike artificial neural networks, the decisions of the linear SVMs are much easier to analyze and, from a good predictor, we can infer rich biochemical knowledge. In the Budapest Amyloid Predictor webserver the user needs to input a hexapeptide, and the server outputs a prediction for the input plus the 6 × 19 = 114 distance-1 neighbors of the input hexapeptide.


2021 ◽  
Vol 11 (4) ◽  
pp. 1581
Author(s):  
Jimy Oblitas ◽  
Jezreel Mejia ◽  
Miguel De-la-Torre ◽  
Himer Avila-George ◽  
Lucía Seguí Gil ◽  
...  

Although knowledge of the microstructure of food of vegetal origin helps us to understand the behavior of food materials, the variability in the microstructural elements complicates this analysis. In this regard, the construction of learning models that represent the actual microstructures of the tissue is important to extract relevant information and advance in the comprehension of such behavior. Consequently, the objective of this research is to compare two machine learning techniques—Convolutional Neural Networks (CNN) and Radial Basis Neural Networks (RBNN)—when used to enhance its microstructural analysis. Two main contributions can be highlighted from this research. First, a method is proposed to automatically analyze the microstructural elements of vegetal tissue; and second, a comparison was conducted to select a classifier to discriminate between tissue structures. For the comparison, a database of microstructural elements images was obtained from pumpkin (Cucurbita pepo L.) micrographs. Two classifiers were implemented using CNN and RBNN, and statistical performance metrics were computed using a 5-fold cross-validation scheme. This process was repeated one hundred times with a random selection of images in each repetition. The comparison showed that the classifiers based on CNN produced a better fit, obtaining F1–score average of 89.42% in front of 83.83% for RBNN. In this study, the performance of classifiers based on CNN was significantly higher compared to those based on RBNN in the discrimination of microstructural elements of vegetable foods.


Sign in / Sign up

Export Citation Format

Share Document