scholarly journals TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation

Author(s):  
Yi Luo ◽  
Nima Mesgarani
Author(s):  
Yucheng Zhao ◽  
Chong Luo ◽  
Zheng-Jun Zha ◽  
Wenjun Zeng

In this paper, we introduce Transformer to the time-domain methods for single-channel speech separation. Transformer has the potential to boost speech separation performance because of its strong sequence modeling capability. However, its computational complexity, which grows quadratically with the sequence length, has made it largely inapplicable to speech applications. To tackle this issue, we propose a novel variation of Transformer, named multi-scale group Transformer (MSGT). The key ideas are group self-attention, which significantly reduces the complexity, and multi-scale fusion, which retains Transform's ability to capture long-term dependency. We implement two versions of MSGT with different complexities, and apply them to a well-known time-domain speech separation method called Conv-TasNet. By simply replacing the original temporal convolutional network (TCN) with MSGT, our approach called MSGT-TasNet achieves a large gain over Conv-TasNet on both WSJ0-2mix and WHAM! benchmarks. Without bells and whistles, the performance of MSGT-TasNet is already on par with the SOTA methods.


Entropy ◽  
2021 ◽  
Vol 23 (1) ◽  
pp. 116
Author(s):  
Xiangfa Zhao ◽  
Guobing Sun

Automatic sleep staging with only one channel is a challenging problem in sleep-related research. In this paper, a simple and efficient method named PPG-based multi-class automatic sleep staging (PMSS) is proposed using only a photoplethysmography (PPG) signal. Single-channel PPG data were obtained from four categories of subjects in the CAP sleep database. After the preprocessing of PPG data, feature extraction was performed from the time domain, frequency domain, and nonlinear domain, and a total of 21 features were extracted. Finally, the Light Gradient Boosting Machine (LightGBM) classifier was used for multi-class sleep staging. The accuracy of the multi-class automatic sleep staging was over 70%, and the Cohen’s kappa statistic k was over 0.6. This also showed that the PMSS method can also be applied to stage the sleep state for patients with sleep disorders.


2014 ◽  
Vol 53 (7S) ◽  
pp. 07KC14 ◽  
Author(s):  
Tan Yiyu ◽  
Yasushi Inoguchi ◽  
Yukinori Sato ◽  
Makoto Otani ◽  
Yukio Iwaya ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document