scholarly journals A binaural sound source localization model based on time-delay compensation and interaural coherence

Author(s):  
Hong Liu ◽  
Jie Zhang
2021 ◽  
Vol 263 (4) ◽  
pp. 2279-2283
Author(s):  
Soo Young Lee ◽  
Jiho Chang ◽  
Seungchul Lee

In this contribution, we present a high-resolution and accurate sound source localization via a deep learning framework. While the spherical microphone arrays can be utilized to produce omnidirectional beams, it is widely known that the conventional spherical harmonics beamforming (SHB) has a limit in terms of its spatial resolution. To accomplish the sound source localization with high resolution and preciseness, we propose a convolutional neural network (CNN)-based source localization model as a way of a data-driven approach. We first present a novel way to define the source distribution map that can spatially represent the single point source's position and strength. By utilizing paired dataset with spherical harmonics beamforming maps and our proposed high-resolution maps, we develop a fully convolutional neural network based on the encoder-decoder structure for establishing the image-to-image transformation model. Both quantitative and qualitative results are demonstrated to evaluate the powerfulness of the proposed data-driven source localization model.


2019 ◽  
Vol 9 (12) ◽  
pp. 2417
Author(s):  
Hongyan Xing ◽  
Xu Yang

To reduce the negative effect on sound source localization when the source is at an extreme angle and improve localization precision and stability, a theoretical model of a three-plane five-element microphone array is established, using time-delay values to judge the sound source’s quadrant position. Corresponding judgment criteria were proposed, solving the problem in which a single-plane array easily blurs the measured position. Based on sound source geometric localization, a formula for the sound source azimuth calculation of a single-plane five-element microphone array was derived. The sinusoids and cosines of two elevation angles based on two single-plane arrays were introduced into the sound source spherical coordinates as composite weighted coefficients, and a sound source localization fusion algorithm based on a three-plane five-element microphone array was proposed. The relationship between the time-delay estimation error, elevation angle, horizontal angle, and microphone array localization performance was discussed, and the precision and stability of ranging and direction finding were analyzed. The results show that the measurement precision of the distance from the sound source to the array center and the horizontal angle are improved one to threefold, and the measurement precision of the elevation angle is improved one to twofold. Although there is a small error, the overall performance of the sound source localization is stable, reflecting the advantages of the fusion algorithm.


Sign in / Sign up

Export Citation Format

Share Document