A Convolutional Network with Multi-Scale and Attention Mechanisms for End-to-End Single-Channel Speech Enhancement

IEEE Signal Processing Letters ◽

10.1109/lsp.2021.3093859 ◽

2021 ◽

pp. 1-1

Author(s):

Xiang Xiaoxiao ◽

Zhang Xiaojuan ◽

Chen Haozhe

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Convolutional Network ◽

Multi Scale ◽

Download Full-text

A Multi-Scale Feature Recalibration Network for End-to-End Single Channel Speech Enhancement

IEEE Journal of Selected Topics in Signal Processing ◽

10.1109/jstsp.2020.3045846 ◽

2020 ◽

pp. 1-1

Author(s):

Yang Xian ◽

Yang Sun ◽

Wenwu Wang ◽

Syed Mohsen Naqvi

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Scale Feature ◽

Multi Scale ◽

Download Full-text

Multi-scale decomposition based supervised single channel deep speech enhancement

Applied Soft Computing ◽

10.1016/j.asoc.2020.106666 ◽

2020 ◽

Vol 95 ◽

pp. 106666

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Download Full-text

An NMF-based MMSE Approach for Single Channel Speech Enhancement Using Densely Connected Convolutional Network

10.1109/icspcc52875.2021.9564668 ◽

2021 ◽

Author(s):

Xinyu Li ◽

Changchun Bao ◽

Zihao Cui

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Convolutional Network

Download Full-text

An End-to-End Speech Enhancement Framework Using Stacked Multi-scale Blocks

2019 IEEE 19th International Conference on Communication Technology (ICCT) ◽

10.1109/icct46805.2019.8947298 ◽

2019 ◽

Author(s):

Tian Lan ◽

Sen Li ◽

Yilan Lyu ◽

Chuan Peng ◽

Qiao Liu

Keyword(s):

Speech Enhancement ◽

Multi Scale ◽

Download Full-text

Multi-Scale Group Transformer for Long Sequence Modeling in Speech Separation

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/450 ◽

2020 ◽

Author(s):

Yucheng Zhao ◽

Chong Luo ◽

Zheng-Jun Zha ◽

Wenjun Zeng

Keyword(s):

Time Domain ◽

Single Channel ◽

Sequence Length ◽

Separation Performance ◽

Speech Separation ◽

Convolutional Network ◽

Multi Scale ◽

Sequence Modeling ◽

The Time Domain

In this paper, we introduce Transformer to the time-domain methods for single-channel speech separation. Transformer has the potential to boost speech separation performance because of its strong sequence modeling capability. However, its computational complexity, which grows quadratically with the sequence length, has made it largely inapplicable to speech applications. To tackle this issue, we propose a novel variation of Transformer, named multi-scale group Transformer (MSGT). The key ideas are group self-attention, which significantly reduces the complexity, and multi-scale fusion, which retains Transform's ability to capture long-term dependency. We implement two versions of MSGT with different complexities, and apply them to a well-known time-domain speech separation method called Conv-TasNet. By simply replacing the original temporal convolutional network (TCN) with MSGT, our approach called MSGT-TasNet achieves a large gain over Conv-TasNet on both WSJ0-2mix and WHAM! benchmarks. Without bells and whistles, the performance of MSGT-TasNet is already on par with the SOTA methods.

Download Full-text

PhaseDCN: A Phase-Enhanced Dual-Path Dilated Convolutional Network for Single-Channel Speech Enhancement

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2021.3092585 ◽

2021 ◽

pp. 1-1

Author(s):

Lu Zhang ◽

Mingjiang Wang ◽

Qiquan Zhang ◽

Xinsheng Wang ◽

Ming Liu

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Convolutional Network

Download Full-text

DPTCN-ATPP: Multi-scale End-to-end Modeling for Single-channel Speech Separation

10.1109/iccis53528.2021.9645957 ◽

2021 ◽

Author(s):

Yanmin Zhu ◽

Xiang Zheng ◽

Xinrong Wu ◽

Wanning Liu ◽

Lei Pi ◽

...

Keyword(s):

Single Channel ◽

Speech Separation ◽

Multi Scale ◽

Download Full-text

Multi-Scale Residual Convolutional Encoder Decoder with Bidirectional Long Short-Term Memory for Single Channel Speech Enhancement

2020 28th European Signal Processing Conference (EUSIPCO) ◽

10.23919/eusipco47968.2020.9287618 ◽

2021 ◽

Author(s):

Yang Xian ◽

Yang Sun ◽

Wenwu Wang ◽

Syed Mohsen Naqvi

Keyword(s):

Speech Enhancement ◽

Short Term Memory ◽

Single Channel ◽

Term Memory ◽

Multi Scale ◽

Long Short Term Memory ◽

Convolutional Encoder

Download Full-text

Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions

10.1109/o-cocosda202152914.2021.9660563 ◽

2021 ◽

Author(s):

Md Mahbub E Noor ◽

Yen-Ju Lu ◽

Syu-Siang Wang ◽

Supratip Ghose ◽

Chia-Yu Chang ◽

...

Keyword(s):

Speech Recognition ◽

Frequency Domain ◽

Speech Enhancement ◽

Automatic Speech Recognition ◽

Single Channel ◽

Noisy Conditions ◽

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text