Multi-speaker DOA estimation in reverberation conditions using expectation-maximization

AbstractThe problem of blind and online speaker localization and separation using multiple microphones is addressed based on the recursive expectation-maximization (REM) procedure. A two-stage REM-based algorithm is proposed: (1) multi-speaker direction of arrival (DOA) estimation and (2) multi-speaker relative transfer function (RTF) estimation. The DOA estimation task uses only the time frequency (TF) bins dominated by a single speaker while the entire frequency range is not required to accomplish this task. In contrast, the RTF estimation task requires the entire frequency range in order to estimate the RTF for each frequency bin. Accordingly, a different statistical model is used for the two tasks. The first REM model is applied under the assumption that the speech signal is sparse in the TF domain, and utilizes a mixture of Gaussians (MoG) model to identify the TF bins associated with a single dominant speaker. The corresponding DOAs are estimated using these bins. The second REM model is applied under the assumption that the speakers are concurrently active in all TF bins and consequently applies a multichannel Wiener filter (MCWF) to separate the speakers. As a result of the assumption of the concurrent speakers, a more precise TF map of the speakers’ activity is obtained. The RTFs are estimated using the outputs of the MCWF-beamformer (BF), which are constructed using the DOAs obtained in the previous stage. Next, using the linearly constrained minimum variance (LCMV)-BF that utilizes the estimated RTFs, the speech signals are separated. The algorithm is evaluated using real-life scenarios of two speakers. Evaluation of the mean absolute error (MAE) of the estimated DOAs and the separation capabilities, demonstrates significant improvement w.r.t. a baseline DOA estimation and speaker separation algorithm.

Download Full-text

Evaluating expectation-maximization algorithm for 2D DOA estimation via planar antenna arrays

Proceedings of the International Conference on High Performance Compilation, Computing and Communications - HP3C-2017 ◽

10.1145/3069593.3069595 ◽

2017 ◽

Author(s):

Y. B. Nechaev ◽

K. D. Alkhafaji Sarmad ◽

I. W. Peshkov

Keyword(s):

Antenna Arrays ◽

Expectation Maximization ◽

Expectation Maximization Algorithm ◽

Doa Estimation ◽

Planar Antenna ◽

2D Doa Estimation

Download Full-text

1-Bit DOA Estimation using Expectation-maximization Generalized Approximate Message Passing with two L-shaped arrays

IEEE Communications Letters ◽

10.1109/lcomm.2021.3079307 ◽

2021 ◽

pp. 1-1

Author(s):

Chenyu Li ◽

Qing Wang ◽

Hua Chen ◽

Liping Teng

Keyword(s):

Expectation Maximization ◽

Message Passing ◽

Doa Estimation ◽

Approximate Message Passing

Download Full-text

Mixture Modeling based Multikernel Sparse Learning for Directional of Arrival Estimation

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1132.0886s19 ◽

2019 ◽

Vol 8 (6S) ◽

pp. 670-675

Keyword(s):

Expectation Maximization ◽

Expectation Maximization Algorithm ◽

Doa Estimation ◽

Mixture Modeling ◽

Sparse Signal ◽

Sparse Signal Recovery ◽

Sparse Learning ◽

Signal Recovery ◽

Uniform Array ◽

Virtual Array

Direction of Arrival (DOA) estimation problem is defined as the problem of Sparse Signal Recovery (SSR) in researches published on the Uniform or Non Uniform array based implementations. This Paper attempts a Multikernel Sparse learning (MSL) approach with mixture modeling for the SSR problem to improve the performance parameters including the PSNR and the RMSE of the estimated sparse signal in the underdetermined condition. The Expectation Maximization algorithm is exploited to obtain the convergence in the mixture modeling MSL method. The virtual array response problem thus developed uses the mixture modeling MSL to estimate the DOA. Matlab based implementation is carried out and the results are found to be satisfactory.

Download Full-text