scholarly journals Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

Author(s):  
Azar Mahmoodzadeh ◽  
Hamid Reza Abutalebi ◽  
Hamid Soltanian-Zadeh ◽  
Hamid Sheikhzadeh
2020 ◽  
Vol 30 (01) ◽  
pp. 2050003
Author(s):  
Wenjie Peng ◽  
Kaiqi Fu ◽  
Wei Zhang ◽  
Yanlu Xie ◽  
Jinsong Zhang

Pitch-range estimation from brief speech segments could bring benefits to many tasks like automatic speech recognition and speaker recognition. To estimate pitch range, previous studies have proposed to utilize deep-learning-based models with spectrum information as input. They demonstrated that such method works and could still achieve reliable estimation results when the speech segment is as brief as 300 ms. In this study, we evaluated the robustness of this method. We take the following scenarios into account: (1) a large number of training speakers; (2) different language backgrounds; and (3) monosyllabic utterances with different tones. Experimental results showed that: (1) The use of a large number of training speakers improved the estimation accuracies. (2) The mean absolute percentage error (MAPE) rate evaluated on the L2 speakers is similar to that on the native speakers. (3) Different tonal information will affect the LSTM-based model, but this influence is limited compared to the baseline method which calculates pitch-range targets from the distribution of [Formula: see text]0 values. These experimental results verified the efficiency of the LSTM-based pitch-range estimation method.


Author(s):  
Qi Zhang ◽  
Chong Cao ◽  
Tiantian Li ◽  
Yanlu Xie ◽  
Jinsong Zhang
Keyword(s):  

2018 ◽  
Vol 2018 ◽  
pp. 1-9 ◽  
Author(s):  
Haiwen Li ◽  
Nae Zheng ◽  
Xiyu Song ◽  
Yinghua Tian

The estimation speed of positioning parameters determines the effectiveness of the positioning system. The time of arrival (TOA) and direction of arrival (DOA) parameters can be estimated by the space-time two-dimensional multiple signal classification (2D-MUSIC) algorithm for array antenna. However, this algorithm needs much time to complete the two-dimensional pseudo spectral peak search, which makes it difficult to apply in practice. Aiming at solving this problem, a fast estimation method of space-time two-dimensional positioning parameters based on Hadamard product is proposed in orthogonal frequency division multiplexing (OFDM) system, and the Cramer-Rao bound (CRB) is also presented. Firstly, according to the channel frequency domain response vector of each array, the channel frequency domain estimation vector is constructed using the Hadamard product form containing location information. Then, the autocorrelation matrix of the channel response vector for the extended array element in frequency domain and the noise subspace are calculated successively. Finally, by combining the closed-form solution and parameter pairing, the fast joint estimation for time delay and arrival direction is accomplished. The theoretical analysis and simulation results show that the proposed algorithm can significantly reduce the computational complexity and guarantee that the estimation accuracy is not only better than estimating signal parameters via rotational invariance techniques (ESPRIT) algorithm and 2D matrix pencil (MP) algorithm but also close to 2D-MUSIC algorithm. Moreover, the proposed algorithm also has certain adaptability to multipath environment and effectively improves the ability of fast acquisition of location parameters.


Sign in / Sign up

Export Citation Format

Share Document