Incorporation of phase information for improved time-dependent instrument recognition
Keyword(s):
AbstractTime-dependent estimation of playing instruments in music recordings is an important preprocessing for several music signal processing algorithms. In this approach, instrument recognition is realized by neural networks with a two-dimensional input of short-time Fourier transform (STFT) magnitudes and a time-frequency representation based on phase information. The modified group delay (MODGD) function and the product spectrum (PS), which is based on MODGD, are analysed as phase representations. Training and evaluation processes are executed based on the MusicNet dataset. By the incorporation of PS in the input, instrument recognition can be improved about 2% in F1-score.
2015 ◽
Vol 12
(03)
◽
pp. 1550021
◽
2020 ◽
Vol 65
(4)
◽
pp. 379-391
◽
2017 ◽
Vol 2017
◽
pp. 1-14
◽
Keyword(s):