Transformation of short-term spectral envelope of speech signal using multivariate polynomial modeling

Author(s):  
P. K. Lehana ◽  
P. C. Pandey
2021 ◽  
Vol 12 ◽  
Author(s):  
Xiang Chen ◽  
Rubing Huang ◽  
Xin Li ◽  
Lei Xiao ◽  
Ming Zhou ◽  
...  

Emotional design is an important development trend of interaction design. Emotional design in products plays a key role in enhancing user experience and inducing user emotional resonance. In recent years, based on the user's emotional experience, the design concept of strengthening product emotional design has become a new direction for most designers to improve their design thinking. In the emotional interaction design, the machine needs to capture the user's key information in real time, recognize the user's emotional state, and use a variety of clues to finally determine the appropriate user model. Based on this background, this research uses a deep learning mechanism for more accurate and effective emotion recognition, thereby optimizing the design of the interactive system and improving the user experience. First of all, this research discusses how to use user characteristics such as speech, facial expression, video, heartbeat, etc., to make machines more accurately recognize human emotions. Through the analysis of various characteristics, the speech is selected as the experimental material. Second, a speech-based emotion recognition method is proposed. The mel-Frequency cepstral coefficient (MFCC) of the speech signal is used as the input of the improved long and short-term memory network (ILSTM). To ensure the integrity of the information and the accuracy of the output at the next moment, ILSTM makes peephole connections in the forget gate and input gate of LSTM, and adds the unit state as input data to the threshold layer. The emotional features obtained by ILSTM are input into the attention layer, and the self-attention mechanism is used to calculate the weight of each frame of speech signal. The speech features with higher weights are used to distinguish different emotions and complete the emotion recognition of the speech signal. Experiments on the EMO-DB and CASIA datasets verify the effectiveness of the model for emotion recognition. Finally, the feasibility of emotional interaction system design is discussed.


2017 ◽  
Vol 14 (27) ◽  
pp. 97-104
Author(s):  
Itamar Magno BARBOSA ◽  
Bogos Nubar SISMANOGLU ◽  
Pedro Ivo Pinto OLIVEIRA

We analyzed a multivariate polynomial model related to Calibration Curve of an External Balance of Aerodynamic Forces. The ISO 17025 explicit that it is not always possible to calculate rigorously the measurement uncertainty in Test Laboratories. The test nature circumscribes error sources control, time and costs. This fact implies that choosing a Mathematical Modeling is also an affair of productivity management, not only a computational matter. For a First Degree Polynomial Modeling we evaluated the Statistical Performance Index, as to Measurement Bias, Standard Uncertainty, X – Square and Uncertainties and Co – Variances Matrix. We evaluated the Measurement System Repeatability through successive resembling Calibrations. We showed Aerodynamic Forces that may be considered individual ones trough uncertainties and systematic errors. In the same way, bias was verified through systematic error analysis, i. e., in what conditions the polynomial intersects at the origin or not.


2014 ◽  
Vol 513-517 ◽  
pp. 2906-2909
Author(s):  
Wei Ping Cui ◽  
Xiu Yan Wang

Frequency domain analysis of speech signal includes spectrum analysis, power spectrum and the inverse spectrum, spectral envelope analysis. The common methods of frequency domain analysis includes bandpass filters , Fourier transform, linear see method etc.. Using MATLAB discusses the short section of the speech signal spectrum, cepstrum and complex spectrum, the pitch and formant simulation results are given.


Sign in / Sign up

Export Citation Format

Share Document