Low Latency Convolutive Blind Source Separation

10.26686/wgtn.17136158 ◽

2021 ◽

Author(s):

◽

Jiawen Chua

Keyword(s):

Frequency Domain ◽

Real Time ◽

Impulse Response ◽

Source Separation ◽

Frequency Resolution ◽

Separation Performance ◽

Window Length ◽

Time Frequency ◽

Time Systems ◽

Separation Parameters

<p>In most real-time systems, particularly for applications involving system identification, latency is a critical issue. These applications include, but are not limited to, blind source separation (BSS), beamforming, speech dereverberation, acoustic echo cancellation and channel equalization. The system latency consists of an algorithmic delay and an estimation computational time. The latter can be avoided by using a multi-thread system, which runs the estimation process and the processing procedure simultaneously. The former, which consists of a delay of one window length, is usually unavoidable for the frequency-domain approaches. For frequency-domain approaches, a block of data is acquired by using a window, transformed and processed in the frequency domain, and recovered back to the time domain by using an overlap-add technique. In the frequency domain, the convolutive model, which is usually used to describe the process of a linear time-invariant (LTI) system, can be represented by a series of multiplicative models to facilitate estimation. To implement frequency-domain approaches in real-time applications, the short-time Fourier transform (STFT) is commonly used. The window used in the STFT must be at least twice the room impulse response which is long, so that the multiplicative model is sufficiently accurate. The delay constraint caused by the associated blockwise processing window length makes most the frequency-domain approaches inapplicable for real-time systems. This thesis aims to design a BSS system that can be used in a real-time scenario with minimal latency. Existing BSS approaches can be integrated into our system to perform source separation with low delay without affecting the separation performance. The second goal is to design a BSS system that can perform source separation in a non-stationary environment. We first introduce a subspace approach to directly estimate the separation parameters in the low-frequency-resolution time-frequency (LFRTF) domain. In the LFRTF domain, a shorter window is used to reduce the algorithmic delay of the system during the signal acquisition, e.g., the window length is shorter than the room impulse response. The subspace method facilitates the deconvolution of a convolutive mixture to a new instantaneous mixture and simplifies the estimation process. Second, we propose an alternative approach to address the algorithmic latency problem. The alternative method enables us to obtain the separation parameters in the LFRTF domain based on parameters estimated in the high-frequency-resolution time-frequency (HFRTF) domain, where the window length is longer than the room impulse response, without affecting the separation performance. The thesis also provides a solution to address the BSS problem in a non-stationary environment. We utilize the ``meta-information" that is obtained from previous BSS operations to facilitate the separation in the future without performing the entire BSS process again. Repeating a BSS process can be computationally expensive. Most conventional BSS algorithms require sufficient signal samples to perform analysis and this prolongs the estimation delay. By utilizing information from the entire spectrum, our method enables us to update the separation parameters with only a single snapshot of observation data. Hence, our method minimizes the estimation period, reduces the redundancy and improves the efficacy of the system. The final contribution of the thesis is a non-iterative method for impulse response shortening. This method allows us to use a shorter representation to approximate the long impulse response. It further improves the computational efficiency of the algorithm and yet achieves satisfactory performance.</p>

Download Full-text

Blind Source Separation in the Time-Frequency Domain Based on Multiple Hypothesis Testing

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2007.914316 ◽

2008 ◽

Vol 56 (6) ◽

pp. 2267-2279 ◽

Cited By ~ 10

Author(s):

L. Cirillo ◽

A. Zoubir ◽

M. Amin

Keyword(s):

Hypothesis Testing ◽

Frequency Domain ◽

Blind Source Separation ◽

Source Separation ◽

Multiple Hypothesis Testing ◽

Time Frequency ◽

Multiple Hypothesis

Download Full-text

Blind source separation of acoustic mixtures using time-frequency domain independent component analysis

The 8th International Conference on Communication Systems, 2002. ICCS 2002. ◽

10.1109/iccs.2002.1183286 ◽

2003 ◽

Cited By ~ 1

Author(s):

D.S. Jayarman ◽

G. Sitaraman ◽

R. Seshadri

Keyword(s):

Independent Component Analysis ◽

Frequency Domain ◽

Blind Source Separation ◽

Source Separation ◽

Component Analysis ◽

Independent Component ◽

Time Frequency ◽

Domain Independent

Download Full-text

Real-time frequency-domain terahertz sensing and imaging of isopropyl alcohol–water mixtures on a microfluidic chip

Sensors and Actuators B Chemical ◽

10.1016/j.snb.2013.04.008 ◽

2013 ◽

Vol 184 ◽

pp. 228-234 ◽

Cited By ~ 24

Author(s):

Lei Liu ◽

Rahul Pathak ◽

Li-Jing Cheng ◽

Tao Wang

Keyword(s):

Frequency Domain ◽

Real Time ◽

Microfluidic Chip ◽

Isopropyl Alcohol ◽

Time Frequency ◽

Alcohol Water ◽

Terahertz Sensing

Download Full-text

Underdetermined source separation of EEG signals in the time-frequency domain

2008 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2008.4518440 ◽

2008 ◽

Cited By ~ 3

Author(s):

Zeyong Shan ◽

Jacob Swary ◽

Selin Aviyente

Keyword(s):

Frequency Domain ◽

Source Separation ◽

Eeg Signals ◽

Time Frequency

Download Full-text

Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution

Sensors ◽

10.3390/s18051371 ◽

2018 ◽

Vol 18 (5) ◽

pp. 1371 ◽

Cited By ~ 5

Author(s):

Wai Lok Woo ◽

Bin Gao ◽

Ahmed Bouridane ◽

Bingo Wing-Kuen Ling ◽

Cheng Siong Chin

Keyword(s):

Unsupervised Learning ◽

Single Channel ◽

Learning Algorithm ◽

Source Separation ◽

Nonnegative Matrix ◽

Least Square ◽

Separation Performance ◽

Time Frequency ◽

Special Cases ◽

Leibler Divergence

This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time–frequency deconvolution with optimized fractional β-divergence. The β-divergence is a group of cost functions parametrized by a single parameter β. The Itakura–Saito divergence, Kullback–Leibler divergence and Least Square distance are special cases that correspond to β=0, 1, 2, respectively. This paper presents a generalized algorithm that uses a flexible range of β that includes fractional values. It describes a maximization–minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time–frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional β value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy.

Download Full-text

Underdetermined convolutive blind source separation in the time–frequency domain based on single source points and experimental validation

Measurement Science and Technology ◽

10.1088/1361-6501/ab816f ◽

2020 ◽

Vol 31 (9) ◽

pp. 095001

Author(s):

Wei Cheng ◽

Zhengzheng Jia ◽

Xuefeng Chen ◽

Linsheng Han ◽

Guanghui Zhou ◽

...

Keyword(s):

Frequency Domain ◽

Blind Source Separation ◽

Experimental Validation ◽

Source Separation ◽

Single Source ◽

Time Frequency ◽

Single Source Points ◽

Convolutive Blind Source Separation

Download Full-text

An adaptive time-frequency resolution approach for Non-negative Matrix Factorization based single channel sound source separation

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2011.5946388 ◽

2011 ◽

Cited By ~ 7

Author(s):

Serap Kirbiz ◽

Paris Smaragdis

Keyword(s):

Sound Source ◽

Matrix Factorization ◽

Single Channel ◽

Source Separation ◽

Frequency Resolution ◽

Time Frequency ◽

Sound Source Separation ◽

Adaptive Time ◽

Non Negative Matrix Factorization

Download Full-text

An adaptive time-frequency resolution framework for single channel source separation based on non-negative tensor factorization

2013 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2013.6637780 ◽

2013 ◽

Cited By ~ 1

Author(s):

S. Kirbiz ◽

B. Gunsel

Keyword(s):

Single Channel ◽

Source Separation ◽

Frequency Resolution ◽

Tensor Factorization ◽

Time Frequency ◽

Adaptive Time

Download Full-text

The Relationship between Frequency Domain Blind Source Separation and Frequency Domain Adaptive Beamformer

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.490-491.654 ◽

2014 ◽

Vol 490-491 ◽

pp. 654-662

Author(s):

Si Chong Qian ◽

Yang Xiang

Keyword(s):

Frequency Domain ◽

Mean Square Error ◽

Blind Source Separation ◽

Array Signal Processing ◽

Source Separation ◽

Adaptive Beamforming ◽

Separation Performance ◽

Mean Square ◽

The Mean ◽

The Relationship

As two important methods of array signal processing, blind source separation and beamforming can extract the target signal and suppress interference by using the received information of the array element. In the case of convolution mixture of sources, frequency domain blind source separation and frequency domain adaptive beamforming have similar signal model. To find the relationship between them, comparison between the minimization of the off-diagonal components in the BSS update equation and the minimization of the mean square error in the ABF had been made from the perspective of mathematical expressions, and find that the unmixing matrix of the BSS and the filter coefficients of the ABF converge to the same solution in the mean square error sense under the condition that the two source signals are ideally independent. With MATLAB, the equivalence in the frequency domain have been verified and the causes affecting separation performance have been analyzed, which was achieved by simulating instantaneous and convolution mixtures and separating mixture speech in frequency-domain blind source separation and frequency domain adaptive beamforming way.

Download Full-text