Semi-Supervised Singing Voice Separation With Noisy Self-Training

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9413723 ◽

2021 ◽

Author(s):

Zhepei Wang ◽

Ritwik Giri ◽

Umut Isik ◽

Jean-Marc Valin ◽

Arvindh Krishnaswamy

Keyword(s):

Singing Voice ◽

Singing Voice Separation

Download Full-text

Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2016.2577879 ◽

2016 ◽

Vol 24 (11) ◽

pp. 2084-2095 ◽

Author(s):

Yukara Ikemiya ◽

Katsutoshi Itoyama ◽

Kazuyoshi Yoshii

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Singing Voice ◽

Robust Principal Component Analysis ◽

Singing Voice Separation

Download Full-text

A recurrent encoder-decoder approach with skip-filtering connections for monaural singing voice separation

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) ◽

10.1109/mlsp.2017.8168117 ◽

2017 ◽

Author(s):

Stylianos Ioannis Mimilakis ◽

Konstantinos Drossos ◽

Tuomas Virtanen ◽

Gerald Schuller

Keyword(s):

Singing Voice ◽

Singing Voice Separation

Download Full-text

Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation

IEEE Transactions on Audio Speech and Language Processing ◽

10.1109/tasl.2013.2266773 ◽

2013 ◽

Vol 21 (10) ◽

pp. 2096-2107 ◽

Author(s):

Bilei Zhu ◽

Wei Li ◽

Ruijiang Li ◽

Xiangyang Xue

Keyword(s):

Matrix Factorization ◽

Singing Voice ◽

Multi Stage ◽

Singing Voice Separation ◽

Non Negative Matrix Factorization

Download Full-text

Monaural Singing Voice Separation with Skip-Filtering Connections and Recurrent Inference of Time-Frequency Mask

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461822 ◽

2018 ◽

Author(s):

Stylianos Ioannis Mimilakis ◽

Konstantinos Drossos ◽

Joao F. Santos ◽

Gerald Schuller ◽

Tuomas Virtanen ◽

...

Keyword(s):

Singing Voice ◽

Time Frequency ◽

Singing Voice Separation

Download Full-text

Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.1109/apsipaasc47483.2019.9023055 ◽

2019 ◽

Author(s):

Feng Li ◽

Kaizhi Qian ◽

Mark Hasegawa-Johnson ◽

Masato Akagi

Keyword(s):

Singing Voice ◽

Time Frequency ◽

Singing Voice Separation

Download Full-text

Improving Singing Voice Separation Using Curriculum Learning on Recurrent Neural Networks

Applied Sciences ◽

10.3390/app10072465 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2465

Author(s):

Seungtae Kang ◽

Jeong-Sik Park ◽

Gil-Jin Jang

Keyword(s):

Single Channel ◽

The Other ◽

Superior Performance ◽

Difficulty Level ◽

Difficult Case ◽

Single Source ◽

Singing Voice ◽

Learning Framework ◽

Easy Case ◽

Singing Voice Separation

Single-channel singing voice separation has been considered a difficult task, as it requires predicting two different audio sources independently from mixed vocal and instrument sounds recorded by a single microphone. We propose a new singing voice separation approach based on the curriculum learning framework, in which learning is started with only easy examples and then task difficulty is gradually increased. In this study, we regard the data providing obviously dominant characteristics of a single source as an easy case and the other data as a difficult case. To quantify the dominance property between two sources, we define a dominance factor that determines a difficulty level according to relative intensity between vocal sound and instrument sound. If a given data is determined to provide obviously dominant characteristics of a single source according to the factor, it is regarded as an easy case; otherwise, it belongs to a difficult case. Early stages in the learning focus on easy cases, thus allowing rapidly learning overall characteristics of each source. On the other hand, later stages handle difficult cases, allowing more careful and sophisticated learning. In experiments conducted on three song datasets, the proposed approach demonstrated superior performance compared to the conventional approaches.

Download Full-text

Improving Singing Voice Separation Using Attribute-Aware Deep Network

2019 International Workshop on Multilayer Music Representation and Processing (MMRP) ◽

10.1109/mmrp.2019.00019 ◽

2019 ◽

Author(s):

Rupak Vignesh Swaminathan ◽

Alexander Lerch

Keyword(s):

Singing Voice ◽

Deep Network ◽

Singing Voice Separation

Download Full-text

Combining F0 and non-negative constraint robust principal component analysis for singing voice separation

Signal Processing ◽

10.1016/j.sigpro.2019.107432 ◽

2020 ◽

Vol 170 ◽

pp. 107432

Author(s):

Feng Li ◽

Masato Akagi

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Singing Voice ◽

Robust Principal Component Analysis ◽

Singing Voice Separation

Download Full-text

Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis

2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.23919/apsipa.2018.8659640 ◽

2018 ◽

Author(s):

Feng Li ◽

Masato Akagi

Keyword(s):

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Singing Voice ◽

Robust Principal Component Analysis ◽

Singing Voice Separation

Download Full-text

Improving Singing Voice Separation with the Wave-U-Net Using Minimum Hyperspherical Energy

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053424 ◽

2020 ◽

Author(s):

Joaquin Perez-Lapillo ◽

Oleksandr Galkin ◽

Tillman Weyde

Keyword(s):

Singing Voice ◽

Singing Voice Separation

Download Full-text