A new time-frequency representation for music signal analysis: Resonator time-frequency image

Author(s):  
Ruohua Zhou ◽  
Marco Mattavelli
2015 ◽  
Vol 2015 ◽  
pp. 1-13 ◽  
Author(s):  
Josefin Starkhammar ◽  
Maria Hansson-Sandsten

Time-frequency representation algorithms such as spectrograms have proven to be useful tools in marine biosonar signal analysis. Although there are several different time-frequency representation algorithms designed for different types of signals with various characteristics, it is unclear which algorithms that are best suited for transient signals, like the echolocation signals of echolocating whales. This paper describes a comparison of seven different time-frequency representation algorithms with respect to their usefulness when it comes to marine biosonar signals. It also provides the answer to how close in time and frequency two transients can be while remaining distinguishable as two separate signals in time-frequency representations. This is, for instance, relevant in studies where echolocation signal component azimuths are compared in the search for the exact location of their acoustic sources. The smallest time difference was found to be 20 µs and the smallest frequency difference 49 kHz of signals with a −3 dB bandwidth of 40 kHz. Among the tested methods, the Reassigned Smoothed Pseudo Wigner-Ville distribution technique was found to be the most capable of localizing closely spaced signal components.


2020 ◽  
Vol 87 (s1) ◽  
pp. s62-s67
Author(s):  
Markus Schwabe ◽  
Omar Elaiashy ◽  
Fernando Puente León

AbstractTime-dependent estimation of playing instruments in music recordings is an important preprocessing for several music signal processing algorithms. In this approach, instrument recognition is realized by neural networks with a two-dimensional input of short-time Fourier transform (STFT) magnitudes and a time-frequency representation based on phase information. The modified group delay (MODGD) function and the product spectrum (PS), which is based on MODGD, are analysed as phase representations. Training and evaluation processes are executed based on the MusicNet dataset. By the incorporation of PS in the input, instrument recognition can be improved about 2% in F1-score.


2021 ◽  
Vol 88 (5) ◽  
pp. 274-281
Author(s):  
Markus Schwabe ◽  
Michael Heizmann

Abstract An important preprocessing step for several music signal processing algorithms is the estimation of playing instruments in music recordings. To this aim, time-dependent instrument recognition is realized by a neural network with residual blocks in this approach. Since music signal processing tasks use diverse time-frequency representations as input matrices, the influence of different input representations for instrument recognition is analyzed in this work. Three-dimensional inputs of short-time Fourier transform (STFT) magnitudes and an additional time-frequency representation based on phase information are investigated as well as two-dimensional STFT or constant-Q transform (CQT) magnitudes. As additional phase representations, the product spectrum (PS), based on the modified group delay, and the frequency error (FE) matrix, related to the instantaneous frequency, are used. Training and evaluation processes are executed based on the MusicNet dataset, which enables the estimation of seven instruments. With a higher number of frequency bins in the input representations, an improved instrument recognition of about 2 % in F1-score can be achieved. Compared to the literature, frame-level instrument recognition can be improved for different input representations.


2004 ◽  
Vol 270-273 ◽  
pp. 209-214
Author(s):  
C.K. Lee ◽  
Dai Bum Cha ◽  
Jung Taek Kim ◽  
Joo Sun Kim ◽  
Sang J. Lee ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document