scholarly journals Frequency-Domain Blind Source Separation of Many Speech Signals Using Near-Field and Far-Field Models

Author(s):  
Ryo Mukai ◽  
Hiroshi Sawada ◽  
Shoko Araki ◽  
Shoji Makino
2012 ◽  
Vol 433-440 ◽  
pp. 7029-7034
Author(s):  
De Xiang Zhang ◽  
Xiao Pei Wu ◽  
Zhao Lv ◽  
Xiao Jing Guo

The signals of convolutive mixture in time-domain can be transformed to instantaneous mixtures in frequency-domain and complex-valued independent component analysis (CICA) can separate efficiently the signals of instantaneous mixture in each frequency bin. However, since CICA is calculated in each frequency bin independently, the permutation ambiguity becomes a serious problem. The permutation ambiguity of CICA in each frequency bin should be aligned so that a separated signal in the time-domain contains frequency components of the same source signal. The paper presents a novel and efficient approach for solving the permutation problem in frequency domain blind source separation of speech signals. The new algorithm models the frequency-domain separated signals by means of Teager energy correlation between neighboring bins for the detection of correct permutations. Experimental results show that the proposed algorithm can efficiently solve the permutation ambiguity problem in each frequency bin.


Algorithms ◽  
2021 ◽  
Vol 14 (6) ◽  
pp. 165
Author(s):  
Shiyu Guo ◽  
Mengna Shi ◽  
Yanqi Zhou ◽  
Jiayin Yu ◽  
Erfu Wang

As the main method of information transmission, it is particularly important to ensure the security of speech communication. Considering the more complex multipath channel transmission situation in the wireless communication of speech signals and separating or extracting the source signal from the convolutional signal are crucial steps in obtaining source information. In this paper, chaotic masking technology is used to guarantee the transmission safety of speech signals, and a fast fixed-point independent vector analysis algorithm is used to solve the problem of convolutional blind source separation. First, the chaotic masking is performed before the speech signal is sent, and the convolutional mixing process of multiple signals is simulated by impulse response filter. Then, the observed signal is transformed to the frequency domain by short-time Fourier transform, and instantaneous blind source separation is performed using a fast fixed-point independent vector analysis algorithm. The algorithm can preserve the high-order statistical correlation between frequencies to solve the permutation ambiguity problem in independent component analysis. Simulation experiments show that this algorithm can efficiently complete the blind extraction of convolutional signals, and the quality of recovered speech signals is better. It provides a solution for the secure transmission and effective separation of speech signals in multipath transmission channels.


Sign in / Sign up

Export Citation Format

Share Document