sound separation
Recently Published Documents


TOTAL DOCUMENTS

84
(FIVE YEARS 24)

H-INDEX

12
(FIVE YEARS 3)

Author(s):  
Yukai Gong ◽  
Longquan Dai

Monaural music sound separation isolates individual instrument sources from a mono-channel polyphonic mixture. The primary challenge is to separate the source partials overlapped in time-frequency regions, especially for the full overlapping cases that at least one source does not have any nonoverlapping partial. Due to the lack of effective methods to separate the sources with full overlapping partials, this paper put forward a relaxed extended common amplitude modulation (RECAM) approach to deal with the octave sound separation, one of the most difficult cases. Our strategy uses a multi-band co-processing way for each short-time partial wave segment. Extensive experiments are conducted on octave mixture samples drawn from the Iowa University Musical Instrument Database. Results confirm that our RECAM achieves the best separation performance. For nonvibrato and vibrato mixtures, the average improvement of RECAM in each measure exceeds [Formula: see text]dB and [Formula: see text]dB, respectively.


2021 ◽  
Author(s):  
Ethan Grooby ◽  
Jinyuan He ◽  
Davood Fattahi ◽  
Lindsay Zhou ◽  
Arrabella King ◽  
...  
Keyword(s):  

Author(s):  
Scott Wisdom ◽  
Aren Jansen ◽  
Ron J. Weiss ◽  
Hakan Erdogan ◽  
John R. Hershey
Keyword(s):  

Author(s):  
Panagiotis Giannoulis ◽  
Gerasimos Potamianos ◽  
Petros Maragos

2021 ◽  
Vol 263 (4) ◽  
pp. 2044-2051
Author(s):  
Han Li ◽  
Kean Chen ◽  
Bernhard U. Seeber

Noise pollution has become a growing concern in public health. The availability of low-cost wireless acoustic sensor networks permits continuous monitoring of noise. However, real acoustic scenes are composed of irrelevant sources (anomalous noise) that overlap with monitored noise, causing biased evaluation and controversy. One classical scene is selected in our study. For road traffic noise assessment, other possible non-traffic noise (e.g., speech, thunder) should be excluded to obtain a reliable evaluation. Because anomalous noise is diverse, occasional, and unpredictable in real-life scenes, removing it from the mixture is a challenge. We explore a fully convolutional time-domain audio separation network (ConvTasNet) for arbitrary sound separation. ConvTasNet is trained by a large dataset, including environmental sounds, speech, and music over 150 hours. After training, the scale-invariant signal-to-distortion ratio (SI-SDR) is improved by 11.40 dB on average for an independent test dataset. ConvTasNet is next applied to anomalous noise separation of traffic noise scenes. We mix traffic noise and anomalous noise at random SNR between -10 dB to 0 dB. Separation is especially effective for salient and long-term anomalous noise, which smooth the overall sound pressure level curve over time. Results emphasize the importance of anomalous noise separation for reliable evaluation.


2021 ◽  
Author(s):  
Alexander Iliev ◽  
Mayank Dewli ◽  
Muhsin Kalkan ◽  
Preeti Prakash Kudva ◽  
Rekha Turkar

Author(s):  
Scott Wisdom ◽  
Hakan Erdogan ◽  
Daniel P. W. Ellis ◽  
Romain Serizel ◽  
Nicolas Turpault ◽  
...  
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document