sound source localization Latest Research Papers

Active Binaural Auditory Perceptual System for a Socially Interactive Humanoid Robot

Engineering Proceedings ◽

10.3390/engproc2021012083 ◽

2022 ◽

Vol 12 (1) ◽

pp. 83

Author(s):

Sohaib Siddique Butt ◽

Mahnoor Fatima ◽

Ali Asghar ◽

Wasif Muhammad

Keyword(s):

Neural Network ◽

Source Localization ◽

Sound Source ◽

Humanoid Robot ◽

Cross Correlation ◽

Perceptual System ◽

Sound Source Localization ◽

Signal Processing Technique ◽

Gaze Shift ◽

Perception System

Sound Source Localization (SSL) and gaze shift to the sound source behavior is an integral part of a socially interactive humanoid robot perception system. In noisy and reverberant environments, it is non-trivial to estimate the location of a sound source and accurately shift gaze in its direction. Previous SSL algorithms are deficient in the optimum approximation of distance to audio sources and to accurately detect, interpret, and differentiate the actual sound from comparable sound sources due to challenging acoustic environments. In this article, a learning-based model is presented to achieve noiseless and reverberation-resistant sound source localization in the real-world scenarios. The proposed system utilizes a multi-layered Gaussian Cross-Correlation with Phase Transform (GCC-PHAT) signal processing technique as a baseline for a Generalized Cross Correlation Convolution Neural Network (GCC-CNN) model. The proposed model is integrated with an efficient rotation algorithm to predict and orient toward the sound source. The performance of the proposed method is compared with the state-of-art deep network-based sound source localization methods. The findings of the proposed method outperform the existing neural network-based approaches by achieving the highest accuracy of 96.21% for an active binaural auditory perceptual system.

Download Full-text

Drone audition: Sound source localization using on-board microphones

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2022.3140550 ◽

2022 ◽

pp. 1-1

Author(s):

Wageesha Nilmini Manamperi ◽

Thushara Dheemantha Abhayapala ◽

Jihui Amiee Zhang ◽

Prasanga Samarasinghe

Keyword(s):

Source Localization ◽

Sound Source ◽

Sound Source Localization

Download Full-text

An Improved Multiple Sound Source Localization Method Using a Uniform Concentric Circular Microphone Array

10.1109/icicip53388.2021.9642218 ◽

2021 ◽

Author(s):

Yuting Zhang ◽

Hongwei Zhang ◽

Honghai Liu

Keyword(s):

Source Localization ◽

Sound Source ◽

Microphone Array ◽

Sound Source Localization ◽

Localization Method

Download Full-text

Discrete-Time ZND Algorithms for Time-Dependent LQ Decomposition Applied to Sound Source Localization

10.1109/icicip53388.2021.9642202 ◽

2021 ◽

Author(s):

Jinjin Guo ◽

Yunong Zhang

Keyword(s):

Discrete Time ◽

Source Localization ◽

Sound Source ◽

Time Dependent ◽

Sound Source Localization

Download Full-text

Sound Source Localization Using a Convolutional Neural Network and Regression Model

Sensors ◽

10.3390/s21238031 ◽

2021 ◽

Vol 21 (23) ◽

pp. 8031

Author(s):

Tan-Hsu Tan ◽

Yu-Tang Lin ◽

Yang-Lang Chang ◽

Mohammad Alkhaleefah

Keyword(s):

Neural Network ◽

Regression Model ◽

Convolutional Neural Network ◽

Impulse Response ◽

Source Localization ◽

Sound Source ◽

Real Life ◽

Sound Source Localization ◽

Time Frequency ◽

Simulation Scenario

In this research, a novel sound source localization model is introduced that integrates a convolutional neural network with a regression model (CNN-R) to estimate the sound source angle and distance based on the acoustic characteristics of the interaural phase difference (IPD). The IPD features of the sound signal are firstly extracted from time-frequency domain by short-time Fourier transform (STFT). Then, the IPD features map is fed to the CNN-R model as an image for sound source localization. The Pyroomacoustics platform and the multichannel impulse response database (MIRD) are used to generate both simulated and real room impulse response (RIR) datasets. The experimental results show that an average accuracy of 98.96% and 98.31% are achieved by the proposed CNN-R for angle and distance estimations in the simulation scenario at SNR = 30 dB and RT60 = 0.16 s, respectively. Moreover, in the real environment, the average accuracies of the angle and distance estimations are 99.85% and 99.38% at SNR = 30 dB and RT60 = 0.16 s, respectively. The performance obtained in both scenarios is superior to that of existing models, indicating the potential of the proposed CNN-R model for real-life applications.

Download Full-text

Deep learning-based method for multiple sound source localization with high resolution and accuracy

Mechanical Systems and Signal Processing ◽

10.1016/j.ymssp.2021.107959 ◽

2021 ◽

Vol 161 ◽

pp. 107959

Author(s):

Soo Young Lee ◽

Jiho Chang ◽

Seungchul Lee

Keyword(s):

Deep Learning ◽

High Resolution ◽

Source Localization ◽

Sound Source ◽

Sound Source Localization

Download Full-text

Effective DOA Estimation Method for Sound Source Localization Using a Circular Microphone Array

10.1007/978-3-030-92038-8_50 ◽

2021 ◽

pp. 497-505

Author(s):

Douaer Belgacem

Keyword(s):

Source Localization ◽

Sound Source ◽

Microphone Array ◽

Estimation Method ◽

Doa Estimation ◽

Sound Source Localization

Download Full-text

Efficient Energy-based Orthogonal Matching Pursuit Algorithm for Multiple Sound Source Localization with Unknown Source Count

Measurement Science and Technology ◽

10.1088/1361-6501/ac3d46 ◽

2021 ◽

Author(s):

Rongjiang Tang ◽

Yingxiang zuo ◽

Weiya Liu ◽

Liguo Tang ◽

Weiguang Zheng ◽

...

Keyword(s):

Compressed Sensing ◽

Source Localization ◽

Sound Source ◽

Matching Pursuit ◽

Orthogonal Matching Pursuit ◽

Sound Source Localization ◽

Signal Frequency ◽

Localization Algorithm ◽

Sound Sources ◽

Signal Energy

Abstract In this paper, we propose a compressed sensing (CS) sound source localization algorithm based on signal energy to solve the problem of stopping iteration condition of orthogonal matching pursuit reconstruction algorithm in compressed sensing. The orthogonal matching tracking algorithm needs to stop iteration according to the number of sound sources or the change of residual. Generally, the number of sound sources cannot be known in advance, and the residual often leads to unnecessary calculation. Because the sound source is sparsely distributed in space, and its energy is concentrated and higher than that of the environmental noise, the comparison of the signal energy at different positions in each iteration reconstruction signal is used to determine whether the new sound source is added in this iteration. At the same time, the block sparsity is introduced by using multiple frequency points to avoid the problem of different iteration times of different frequency points in the same frame caused by the uneven energy distribution in the signal frequency domain. Simulation and experimental results show that the proposed algorithm retains the advantages of the orthogonal matching tracking sound source localization algorithm, and can complete the iteration well. Under the premise of not knowing the number of sound sources, the maximum error between the number of iterations and the set number of sound sources is 0.31.

Download Full-text

Temporal Characteristics of Azimuthally Moving Sound Source Localization in Patients with Mild and Moderate Sensorineural Hearing Loss

Journal of Evolutionary Biochemistry and Physiology ◽

10.1134/s0022093021060260 ◽

2021 ◽

Vol 57 (6) ◽

pp. 1499-1510

Author(s):

E. A. Klishova ◽

A. P. Gvozdeva ◽

L. E. Golovanova ◽

I. G. Andreeva

Keyword(s):

Hearing Loss ◽

Sensorineural Hearing Loss ◽

Source Localization ◽

Sound Source ◽

Sensorineural Hearing ◽

Sound Source Localization ◽

Temporal Characteristics ◽

Moving Sound Source

Download Full-text

Fundamental study on sound source localization inside a structure using a deep neural network and computer-aided engineering

Journal of Sound and Vibration ◽

10.1016/j.jsv.2021.116400 ◽

2021 ◽

Vol 513 ◽

pp. 116400

Author(s):

Shunsuke Kita ◽

Yoshinobu Kajikawa

Keyword(s):

Neural Network ◽

Source Localization ◽

Sound Source ◽

Deep Neural Network ◽

Computer Aided Engineering ◽

Sound Source Localization ◽

Fundamental Study ◽

Computer Aided

Download Full-text

sound source localization
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Active Binaural Auditory Perceptual System for a Socially Interactive Humanoid Robot

Drone audition: Sound source localization using on-board microphones

An Improved Multiple Sound Source Localization Method Using a Uniform Concentric Circular Microphone Array

Discrete-Time ZND Algorithms for Time-Dependent LQ Decomposition Applied to Sound Source Localization

Sound Source Localization Using a Convolutional Neural Network and Regression Model

Deep learning-based method for multiple sound source localization with high resolution and accuracy

Effective DOA Estimation Method for Sound Source Localization Using a Circular Microphone Array

Efficient Energy-based Orthogonal Matching Pursuit Algorithm for Multiple Sound Source Localization with Unknown Source Count

Temporal Characteristics of Azimuthally Moving Sound Source Localization in Patients with Mild and Moderate Sensorineural Hearing Loss

Fundamental study on sound source localization inside a structure using a deep neural network and computer-aided engineering

Export Citation Format

sound source localizationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Active Binaural Auditory Perceptual System for a Socially Interactive Humanoid Robot

Drone audition: Sound source localization using on-board microphones

An Improved Multiple Sound Source Localization Method Using a Uniform Concentric Circular Microphone Array

Discrete-Time ZND Algorithms for Time-Dependent LQ Decomposition Applied to Sound Source Localization

Sound Source Localization Using a Convolutional Neural Network and Regression Model

Deep learning-based method for multiple sound source localization with high resolution and accuracy

Effective DOA Estimation Method for Sound Source Localization Using a Circular Microphone Array

Efficient Energy-based Orthogonal Matching Pursuit Algorithm for Multiple Sound Source Localization with Unknown Source Count

Temporal Characteristics of Azimuthally Moving Sound Source Localization in Patients with Mild and Moderate Sensorineural Hearing Loss

Fundamental study on sound source localization inside a structure using a deep neural network and computer-aided engineering

sound source localization
Recently Published Documents