scholarly journals Attention-Based Joint Training of Noise Suppression and Sound Event Detection for Noise-Robust Classification

Sensors ◽  
2021 ◽  
Vol 21 (20) ◽  
pp. 6718
Author(s):  
Jin-Young Son ◽  
Joon-Hyuk Chang

Sound event detection (SED) recognizes the corresponding sound event of an incoming signal and estimates its temporal boundary. Although SED has been recently developed and used in various fields, achieving noise-robust SED in a real environment is typically challenging owing to the performance degradation due to ambient noise. In this paper, we propose combining a pretrained time-domain speech-separation-based noise suppression network (NS) and a pretrained classification network to improve the SED performance in real noisy environments. We use group communication with a context codec method (GC3)-equipped temporal convolutional network (TCN) for the noise suppression model and a convolutional recurrent neural network for the SED model. The former significantly reduce the model complexity while maintaining the same TCN module and performance as a fully convolutional time-domain audio separation network (Conv-TasNet). We also do not update the weights of some layers (i.e., freeze) in the joint fine-tuning process and add an attention module in the SED model to further improve the performance and prevent overfitting. We evaluate our proposed method using both simulation and real recorded datasets. The experimental results show that our method improves the classification performance in a noisy environment under various signal-to-noise-ratio conditions.

2020 ◽  
Author(s):  
Xu Zheng ◽  
Yan Song ◽  
Jie Yan ◽  
Li-Rong Dai ◽  
Ian McLoughlin ◽  
...  

Author(s):  
Gianmarco Cerutti ◽  
Rahul Prasad ◽  
Alessio Brutti ◽  
Elisabetta Farella

2020 ◽  
Vol 4 (3) ◽  
pp. 20 ◽  
Author(s):  
Giuseppe Ciaburro

Parking is a crucial element in urban mobility management. The availability of parking areas makes it easier to use a service, determining its success. Proper parking management allows economic operators located nearby to increase their business revenue. Underground parking areas during off-peak hours are uncrowded places, where user safety is guaranteed by company overseers. Due to the large size, ensuring adequate surveillance would require many operators to increase the costs of parking fees. To reduce costs, video surveillance systems are used, in which an operator monitors many areas. However, some activities are beyond the control of this technology. In this work, a procedure to identify sound events in an underground garage is developed. The aim of the work is to detect sounds identifying dangerous situations and to activate an automatic alert that draws the attention of surveillance in that area. To do this, the sounds of a parking sector were detected with the use of sound sensors. These sounds were analyzed by a sound detector based on convolutional neural networks. The procedure returned high accuracy in identifying a car crash in an underground parking area.


2020 ◽  
Author(s):  
Liujun zhang ◽  
Liyan Luo ◽  
Mei Wang ◽  
Xiyu Song ◽  
Shuting Guo ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document