Knowledge Distillation for Singing Voice Detection

Mapping Intimacies ◽

10.21437/interspeech.2021-636 ◽

2021 ◽

Author(s):

Soumava Paul ◽

Gurunath Reddy M ◽

K. Sreenivasa Rao ◽

Partha Pratim Das

Keyword(s):

Singing Voice ◽

Knowledge Distillation ◽

Voice Detection

Download Full-text

A Practical Singing Voice Detection System Based on GRU-RNN

Lecture Notes in Electrical Engineering - Proceedings of the 6th Conference on Sound and Music Technology (CSMT) ◽

10.1007/978-981-13-8707-4_2 ◽

2019 ◽

pp. 15-25

Author(s):

Zhigao Chen ◽

Xulong Zhang ◽

Jin Deng ◽

Juanjuan Li ◽

Yiliang Jiang ◽

...

Keyword(s):

Detection System ◽

Singing Voice ◽

Voice Detection

Download Full-text

On fusion of timbre-motivated features for singing voice detection and singer identification

2008 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2008.4518087 ◽

2008 ◽

Author(s):

Tin Lay Nwe ◽

Haizhou Li

Keyword(s):

Singing Voice ◽

Voice Detection ◽

Singer Identification

Download Full-text

Research on Singing Voice Detection Based on a Long-Term Recurrent Convolutional Network with Vocal Separation and Temporal Smoothing

Electronics ◽

10.3390/electronics9091458 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1458

Author(s):

Xulong Zhang ◽

Yi Yu ◽

Yongwei Gao ◽

Xi Chen ◽

Wei Li

Keyword(s):

Time Domain ◽

Short Term Memory ◽

Detection Algorithm ◽

Singing Voice ◽

Convolutional Network ◽

Voice Detection ◽

Public Datasets

Singing voice detection or vocal detection is a classification task that determines whether a given audio segment contains singing voices. This task plays a very important role in vocal-related music information retrieval tasks, such as singer identification. Although humans can easily distinguish between singing and nonsinging parts, it is still very difficult for machines to do so. Most existing methods focus on audio feature engineering with classifiers, which rely on the experience of the algorithm designer. In recent years, deep learning has been widely used in computer hearing. To extract essential features that reflect the audio content and characterize the vocal context in the time domain, this study adopted a long-term recurrent convolutional network (LRCN) to realize vocal detection. The convolutional layer in LRCN functions in feature extraction, and the long short-term memory (LSTM) layer can learn the time sequence relationship. The preprocessing of singing voices and accompaniment separation and the postprocessing of time-domain smoothing were combined to form a complete system. Experiments on five public datasets investigated the impacts of the different features for the fusion, frame size, and block size on LRCN temporal relationship learning, and the effects of preprocessing and postprocessing on performance, and the results confirm that the proposed singing voice detection algorithm reached the state-of-the-art level on public datasets.

Download Full-text

Singing voice detection for karaoke application

Visual Communications and Image Processing 2005 ◽

10.1117/12.631645 ◽

2005 ◽

Author(s):

Arun Shenoy ◽

Yuansheng Wu ◽

Ye Wang

Keyword(s):

Singing Voice ◽

Voice Detection

Download Full-text

Singing voice detection in pop songs using co-training algorithm

2008 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2008.4517938 ◽

2008 ◽

Author(s):

Swe Zin Kalayar Khine ◽

Tin Lay Nwe ◽

Haizhou Li

Keyword(s):

Training Algorithm ◽

Singing Voice ◽

Voice Detection

Download Full-text

Singing voice detection in music tracks using direct voice vibrato detection

2009 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2009.4959926 ◽

2009 ◽

Author(s):

L. Regnier ◽

G. Peeters

Keyword(s):

Singing Voice ◽

Voice Detection

Download Full-text

Transfer Learning for Improving Singing-Voice Detection in Polyphonic Instrumental Music

10.21437/interspeech.2020-1806 ◽

2020 ◽

Author(s):

Yuanbo Hou ◽

Frank K. Soong ◽

Jian Luan ◽

Shengchen Li

Keyword(s):

Transfer Learning ◽

Instrumental Music ◽

Singing Voice ◽

Voice Detection

Download Full-text

Comparative study of singing voice detection based on deep neural networks and ensemble learning

Human-centric Computing and Information Sciences ◽

10.1186/s13673-018-0158-1 ◽

2018 ◽

Vol 8 (1) ◽

Author(s):

Shingchern D. You ◽

Chien-Hung Liu ◽

Woei-Kae Chen

Keyword(s):

Neural Networks ◽

Comparative Study ◽

Ensemble Learning ◽

Deep Neural Networks ◽

Singing Voice ◽

Voice Detection

Download Full-text

Context-Aware Features for Singing Voice Detection in Polyphonic Music

Adaptive Multimedia Retrieval. Large-Scale Multimedia Retrieval and Evaluation - Lecture Notes in Computer Science ◽

10.1007/978-3-642-37425-8_4 ◽

2013 ◽

pp. 43-57 ◽

Author(s):

Vishweshwara Rao ◽

Chitralekha Gupta ◽

Preeti Rao

Keyword(s):

Context Aware ◽

Singing Voice ◽

Voice Detection ◽

Polyphonic Music

Download Full-text

Singing Voice Detection Using Multi-Feature Deep Fusion with CNN

Lecture Notes in Electrical Engineering - Proceedings of the 7th Conference on Sound and Music Technology (CSMT) ◽

10.1007/978-981-15-2756-2_4 ◽

2019 ◽

pp. 41-52

Author(s):

Xulong Zhang ◽

Shengchen Li ◽

Zijin Li ◽

Shizhe Chen ◽

Yongwei Gao ◽

...

Keyword(s):

Singing Voice ◽

Voice Detection

Download Full-text