The effect of speech denoising algorithms on sound source localization for humanoid robots

[abstFig src='/00290001/03.jpg' width='300' text='Sound source localization and problem' ] We focus on the problem of localizing soft/weak voices recorded by small humanoid robots, such as NAO. Sound source localization (SSL) for such robots requires fast processing and noise robustness owing to the restricted resources and the internal noise close to the microphones. Multiple signal classification using generalized eigenvalue decomposition (GEVD-MUSIC) is a promising method for SSL. It achieves noise robustness by whitening robot internal noise using prior noise information. However, whitening increases the computational cost and creates a direction-dependent bias in the localization score, which degrades the localization accuracy. We have thus developed a new implementation of GEVD-MUSIC based on steering vector transformation (TSV-MUSIC). The application of a transformation equivalent to whitening to steering vectors in advance reduces the real-time computational cost of TSV-MUSIC. Moreover, normalization of the transformed vectors cancels the direction-dependent bias and improves the localization accuracy. Experiments using simulated data showed that TSV-MUSIC had the highest accuracy of the methods tested. An experiment using real recoded data showed that TSV-MUSIC outperformed GEVD-MUSIC and other MUSIC methods in terms of localization by about 4 points under low signal-to-noise-ratio conditions.

Download Full-text

Improved Sound Source Localization and Front-Back Disambiguation for Humanoid Robots with Two Ears

Recent Trends in Applied Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-642-38577-3_29 ◽

2013 ◽

pp. 282-291 ◽

Cited By ~ 2

Author(s):

Ui-Hyun Kim ◽

Kazuhiro Nakadai ◽

Hiroshi G. Okuno

Keyword(s):

Source Localization ◽

Sound Source ◽

Humanoid Robots ◽

Sound Source Localization

Download Full-text

Comparison of Convolution Types in CNN-based Feature Extraction for Sound Source Localization

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287344 ◽

2021 ◽

Author(s):

Daniel Krause ◽

Archontis Politis ◽

Konrad Kowalczyk

Keyword(s):

Feature Extraction ◽

Source Localization ◽

Sound Source ◽

Sound Source Localization

Download Full-text

Sound source localization based on audio-video system of mobile robot

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.02471 ◽

2009 ◽

Vol 29 (9) ◽

pp. 2471-2472

Author(s):

Tao CHEN ◽

Ming-lu ZHANG ◽

Ling-li FU

Keyword(s):

Mobile Robot ◽

Source Localization ◽

Sound Source ◽

Sound Source Localization ◽

Video System ◽

Audio Video

Download Full-text

Self-supervised Neural Audio-Visual Sound Source Localization via Probabilistic Spatial Modeling

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9340938 ◽

2020 ◽

Author(s):

Yoshiki Masuyama ◽

Yoshiaki Bando ◽

Kohei Yatabe ◽

Yoko Sasaki ◽

Masaki Onishi ◽

...

Keyword(s):

Source Localization ◽

Sound Source ◽

Spatial Modeling ◽

Sound Source Localization

Download Full-text

Multi-Sound Source Localization in Time Domain Using Voting Mechanism

2020 IEEE 9th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce50665.2020.9291794 ◽

2020 ◽

Author(s):

Shih-Tsung Yang ◽

Kai-Wen Liang ◽

Pao-Chi Chang

Keyword(s):

Source Localization ◽

Sound Source ◽

Time Domain ◽

Sound Source Localization ◽

Voting Mechanism

Download Full-text

Multi-Tone Phase Coding of Interaural Time Difference for Sound Source Localization with Spiking Neural Networks

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2021.3100684 ◽

2021 ◽

pp. 1-1

Author(s):

Zihan Pan ◽

Malu Zhang ◽

Jibin Wu ◽

Jiadong Wang ◽

Haizhou Li

Keyword(s):

Neural Networks ◽

Source Localization ◽

Sound Source ◽

Interaural Time Difference ◽

Time Difference ◽

Spiking Neural Networks ◽

Sound Source Localization ◽

Phase Coding

Download Full-text

Towards Robust Multiple Blind Source Localization Using Source Separation and Beamforming

Sensors ◽

10.3390/s21020532 ◽

2021 ◽

Vol 21 (2) ◽

pp. 532

Author(s):

Henglin Pu ◽

Chao Cai ◽

Menglan Hu ◽

Tianping Deng ◽

Rong Zheng ◽

...

Keyword(s):

Source Localization ◽

Sound Source ◽

Indoor Localization ◽

Weighting Function ◽

Signal To Noise Ratio ◽

Source Separation ◽

Sound Source Localization ◽

Angle Of Arrival ◽

Sound Sources ◽

Localization Algorithms

Multiple blind sound source localization is the key technology for a myriad of applications such as robotic navigation and indoor localization. However, existing solutions can only locate a few sound sources simultaneously due to the limitation imposed by the number of microphones in an array. To this end, this paper proposes a novel multiple blind sound source localization algorithms using Source seParation and BeamForming (SPBF). Our algorithm overcomes the limitations of existing solutions and can locate more blind sources than the number of microphones in an array. Specifically, we propose a novel microphone layout, enabling salient multiple source separation while still preserving their arrival time information. After then, we perform source localization via beamforming using each demixed source. Such a design allows minimizing mutual interference from different sound sources, thereby enabling finer AoA estimation. To further enhance localization performance, we design a new spectral weighting function that can enhance the signal-to-noise-ratio, allowing a relatively narrow beam and thus finer angle of arrival estimation. Simulation experiments under typical indoor situations demonstrate a maximum of only 4∘ even under up to 14 sources.

Download Full-text