Study on 3D Sound Source Visualization Using Frequency Domain Beamforming Method

Airport crash fire vehicle simulator is an important device that is used in training of the airport firefighters, which has advantage of safe, energy saving and decreasing training cost. Sound system is an important part of airport crash fire vehicle simulator, which provides voice message to training personnel, which can make training personnel have immersed sense. Based on the principles of sound source position to analyze the different physical structure of the human body (head, torso, pinna) the impact of sound localization, the establishment of head-related transfer function model, using the model of the sound source, filtering, delay processing, and virtual sound source. And we have tested and verified with listening experiment.

Download Full-text

Analysis of cerebral blood flow during sound source direction recognition in virtual 3D sound technology

Procedia Computer Science ◽

10.1016/j.procs.2021.09.262 ◽

2021 ◽

Vol 192 ◽

pp. 4837-4844

Author(s):

Kazuki Nagaosa ◽

Hirokazu Miura ◽

Hirokazu Taki

Keyword(s):

Blood Flow ◽

Cerebral Blood Flow ◽

Sound Source ◽

3D Sound ◽

Sound Technology

Download Full-text

A multichannel learning-based approach for sound source separation in reverberant environments

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/s13636-021-00227-2 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

You-Siang Chen ◽

Zi-Jie Lin ◽

Mingsian R. Bai

Keyword(s):

Frequency Domain ◽

Sound Source ◽

Signal To Noise Ratio ◽

Source Separation ◽

Objective Evaluation ◽

Linear Mapping ◽

Invariant Mean ◽

Scale Invariant ◽

Sound Source Separation ◽

Reverberant Field

AbstractIn this paper, a multichannel learning-based network is proposed for sound source separation in reverberant field. The network can be divided into two parts according to the training strategies. In the first stage, time-dilated convolutional blocks are trained to estimate the array weights for beamforming the multichannel microphone signals. Next, the output of the network is processed by a weight-and-sum operation that is reformulated to handle real-valued data in the frequency domain. In the second stage, a U-net model is concatenated to the beamforming network to serve as a non-linear mapping filter for joint separation and dereverberation. The scale invariant mean square error (SI-MSE) that is a frequency-domain modification from the scale invariant signal-to-noise ratio (SI-SNR) is used as the objective function for training. Furthermore, the combined network is also trained with the speech segments filtered by a great variety of room impulse responses. Simulations are conducted for comprehensive multisource scenarios of various subtending angles of sources and reverberation times. The proposed network is compared with several baseline approaches in terms of objective evaluation matrices. The results have demonstrated the excellent performance of the proposed network in dereverberation and separation, as compared to baseline methods.

Download Full-text

Blind 3D sound source direction using stereo microphones based on time-delay estimation and polar-pattern histogram

2017 2nd International Conference on Information Technology (INCIT) ◽

10.1109/incit.2017.8257881 ◽

2017 ◽

Author(s):

Naruephorn Tengtrairat ◽

Wai Lok Woo

Keyword(s):

Time Delay ◽

Sound Source ◽

Time Delay Estimation ◽

Delay Estimation ◽

3D Sound ◽

Pattern Histogram

Download Full-text

Probabilistic 3D sound source mapping using moving microphone array

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2016.7759214 ◽

2016 ◽

Cited By ~ 2

Author(s):

Yoko Sasaki ◽

Ryo Tanabe ◽

Hiroshi Takemura

Keyword(s):

Sound Source ◽

Microphone Array ◽

3D Sound

Download Full-text

3D Sound Source Localization System Based on Learning of Binaural Hearing

2005 IEEE International Conference on Systems, Man and Cybernetics ◽

10.1109/icsmc.2005.1571695 ◽

2006 ◽

Cited By ~ 26

Author(s):

H. Nakashima ◽

T. Mukai

Keyword(s):

Source Localization ◽

Sound Source ◽

Binaural Hearing ◽

Sound Source Localization ◽

Localization System ◽

3D Sound

Download Full-text

Binaural Source Localization and Spatial Audio Reproduction for Telepresence Applications

Presence Teleoperators & Virtual Environments ◽

10.1162/pres.16.5.509 ◽

2007 ◽

Vol 16 (5) ◽

pp. 509-522 ◽

Cited By ~ 14

Author(s):

Fakheredine Keyrouz ◽

Klaus Diepold

Keyword(s):

Source Localization ◽

Sound Source ◽

Transfer Functions ◽

Superior Performance ◽

High Fidelity ◽

Spatial Audio ◽

3D Sound ◽

Perceptual Presence ◽

Remote Environment ◽

Audio Reproduction

Telepresence is generally described as the feeling of being immersed in a remote environment, be it virtual or real. A multimodal telepresence environment, equipped with modalities such as vision, audition, and haptic, improves immersion and augments the overall perceptual presence. The present work focuses on acoustic telepresence at both the teleoperator and operator sites. On the teleoperator side, we build a novel binaural sound source localizer using generic Head Related Transfer Functions (HRTFs). This new localizer provides estimates for the direction of a single sound source given in terms of azimuth and elevation angles in free space by using only two microphones. It also uses an algorithm that is efficient compared to the currently known algorithms used in similar localization processes. On the operator side, the paper addresses the problem of spatially interpolating HRTFs for densely sampled high-fidelity 3D sound synthesis. In our telepresence application scenario the synthesized 3D sound is presented to the operator over headphones and shall achieve a high-fidelity acoustic immersion. Using measured HRTF data, we create interpolated HRTFs between the existing functions using a matrix-valued interpolation function. The comparison with existing interpolation methods reveals that our new method offers superior performance and is capable of achieving high-fidelity reconstructions of HRTFs.

Download Full-text

Real time robot audition system incorporating both 3D sound source localisation and voice characterisation

Proceedings 2007 IEEE International Conference on Robotics and Automation ◽

10.1109/robot.2007.364208 ◽

2007 ◽

Cited By ~ 4

Author(s):

Ben Rudzyn ◽

Waleed Kadous ◽

Claude Sammut

Keyword(s):

Real Time ◽

Sound Source ◽

Source Localisation ◽

3D Sound ◽

Robot Audition

Download Full-text