scholarly journals A neural network approach for sound event detection in real life audio

Author(s):  
Michele Valenti ◽  
Dario Tonelli ◽  
Fabio Vesperini ◽  
Emanuele Principi ◽  
Stefano Squartini
Author(s):  
Gianmarco Cerutti ◽  
Rahul Prasad ◽  
Alessio Brutti ◽  
Elisabetta Farella

2021 ◽  
Author(s):  
Janek Ebbers ◽  
Reinhold Haeb-Umbach

In this paper we present our system for thedetection and classi-fication of acoustic scenes and events (DCASE) 2020 ChallengeTask 4: Sound event detection and separation in domestic envi-ronments. We introduce two new models: the forward-backwardconvolutional recurrent neural network (FBCRNN) and the tag-conditioned convolutional neural network (CNN). The FBCRNNemploys two recurrent neural network (RNN) classifiers sharing thesame CNN for preprocessing. With one RNN processing a record-ing in forward direction and the other in backward direction, thetwo networks are trained to jointly predict audio tags, i.e., weak la-bels, at each time step within a recording, given that at each timestep they have jointly processed the whole recording. The pro-posed training encourages the classifiers to tag events as soon aspossible. Therefore, after training, the networks can be appliedto shorter audio segments of, e.g.,200 ms, allowing sound eventdetection (SED). Further, we propose a tag-conditioned CNN tocomplement SED. It is trained to predict strong labels while using(predicted) tags, i.e., weak labels, as additional input. For train-ing pseudo strong labels from a FBCRNN ensemble are used. Thepresented system scored the fourth and third place in the systemsand teams rankings, respectively. Subsequent improvements allowour system to even outperform the challenge baseline and winnersystems in average by, respectively,18.0 %and2.2 %event-basedF1-score on the validation set. Source code is publicly available athttps://github.com/fgnt/pb_sed


IEEE Access ◽  
2020 ◽  
Vol 8 ◽  
pp. 147337-147348
Author(s):  
Keming Zhang ◽  
Yuanwen Cai ◽  
Yuan Ren ◽  
Ruida Ye ◽  
Liang He

Sign in / Sign up

Export Citation Format

Share Document