A Speech Enhancement Method Using Attention Mechanism and Gated Recurrent Unit

Mapping Intimacies ◽

10.1109/iai53119.2021.9619422 ◽

2021 ◽

Author(s):

Kaibei Peng ◽

Xiaoming Sun ◽

Haowei Chen ◽

Zhen He ◽

Jianrong Wang

Keyword(s):

Speech Enhancement ◽

Attention Mechanism ◽

Enhancement Method ◽

Gated Recurrent Unit

Download Full-text

Speech enhancement from fused features based on deep neural network and gated recurrent unit network

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00813-8 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Youming Wang ◽

Jiali Han ◽

Tianqi Zhang ◽

Didi Qing

Keyword(s):

Neural Network ◽

Deep Learning ◽

Power Spectrum ◽

Speech Enhancement ◽

Deep Neural Network ◽

Series Data ◽

Context Information ◽

Noisy Speech ◽

Enhancement Method ◽

Gated Recurrent Unit

AbstractSpeech is easily interfered by external environment in reality, which results in the loss of important features. Deep learning has become a popular speech enhancement method because of its superior potential in solving nonlinear mapping problems for complex features. However, the deficiency of traditional deep learning methods is the weak learning capability of important information from previous time steps and long-term event dependencies between the time-series data. To overcome this problem, we propose a novel speech enhancement method based on the fused features of deep neural networks (DNNs) and gated recurrent unit (GRU). The proposed method uses GRU to reduce the number of parameters of DNNs and acquire the context information of the speech, which improves the enhanced speech quality and intelligibility. Firstly, DNN with multiple hidden layers is used to learn the mapping relationship between the logarithmic power spectrum (LPS) features of noisy speech and clean speech. Secondly, the LPS feature of the deep neural network is fused with the noisy speech as the input of GRU network to compensate the missing context information. Finally, GRU network is performed to learn the mapping relationship between LPS features and log power spectrum features of clean speech spectrum. The proposed model is experimentally compared with traditional speech enhancement models, including DNN, CNN, LSTM and GRU. Experimental results demonstrate that the PESQ, SSNR and STOI of the proposed algorithm are improved by 30.72%, 39.84% and 5.53%, respectively, compared with the noise signal under the condition of matched noise. Under the condition of unmatched noise, the PESQ and STOI of the algorithm are improved by 23.8% and 37.36%, respectively. The advantage of the proposed method is that it uses the key information of features to suppress noise in both matched and unmatched noise cases and the proposed method outperforms other common methods in speech enhancement.

Download Full-text

Adaptive Single-Channel Speech Enhancement Method for a Push-To-Talk Enabled Wireless Communication Device

IEICE Transactions on Communications ◽

10.1587/transcom.2015ccp0023 ◽

2016 ◽

Vol E99.B (8) ◽

pp. 1745-1753

Author(s):

Hyoung-Gook KIM ◽

Jin Young KIM

Keyword(s):

Wireless Communication ◽

Speech Enhancement ◽

Single Channel ◽

Communication Device ◽

Enhancement Method

Download Full-text

Audio-Visual Speech Enhancement Method Conditioned in the Lip Motion and Speaker-Discriminative Embeddings

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414133 ◽

2021 ◽

Author(s):

Koichiro Ito ◽

Masaaki Yamamoto ◽

Kenji Nagamatsu

Keyword(s):

Speech Enhancement ◽

Visual Speech ◽

Enhancement Method

Download Full-text

A single channel speech enhancement method based on masking properties and minimum statistics

6th International Conference on Signal Processing, 2002. ◽

10.1109/icosp.2002.1181091 ◽

2003 ◽

Author(s):

Jiang Xiaoping ◽

Fu Hua ◽

Yao Tianren

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Enhancement Method ◽

Minimum Statistics

Download Full-text

An Optimization Adaptive BWT Speech Enhancement Method

Information Technology Journal ◽

10.3923/itj.2014.1730.1736 ◽

2014 ◽

Vol 13 (10) ◽

pp. 1730-1736 ◽

Author(s):

Cao Bin-Fang ◽

Li Jian-Qi ◽

Qu Peixin ◽

Peng Guang-Han

Keyword(s):

Speech Enhancement ◽

Enhancement Method

Download Full-text

Low-Illumination Image Enhancement Method Based on Attention Mechanism and Retinex

Laser & Optoelectronics Progress ◽

10.3788/lop57.201004 ◽

2020 ◽

Vol 57 (20) ◽

pp. 201004

Author(s):

黄辉先 Huang Huixian ◽

陈凡浩 Chen Fanhao

Keyword(s):

Image Enhancement ◽

Attention Mechanism ◽

Enhancement Method ◽

Low Illumination

Download Full-text

Medical image enhancement method based on visual attention mechanism

2018 Chinese Automation Congress (CAC) ◽

10.1109/cac.2018.8623071 ◽

2018 ◽

Author(s):

Ning Li ◽

Jianyu Zhao ◽

Ping Jiang ◽

Chunmei Li

Keyword(s):

Visual Attention ◽

Image Enhancement ◽

Medical Image ◽

Attention Mechanism ◽

Enhancement Method ◽

Visual Attention Mechanism ◽

Medical Image Enhancement

Download Full-text

Short-term Load Forecasting Model Based on Attention Mechanism and Gated Recurrent Unit

2019 IEEE 8th International Conference on Advanced Power System Automation and Protection (APAP) ◽

10.1109/apap47170.2019.9225191 ◽

2019 ◽

Author(s):

Song Liu ◽

Pin Lv

Keyword(s):

Load Forecasting ◽

Attention Mechanism ◽

Forecasting Model ◽

Model Based ◽

Short Term Load Forecasting ◽

Gated Recurrent Unit

Download Full-text

A Cepstrum Domain HMM-Based Speech Enhancement Method Applied to Non-Stationary Noise

Multimedia Systems and Applications Series - Signal Processing for Telecommunications and Multimedia ◽

10.1007/0-387-22928-0_1 ◽

2005 ◽

pp. 1-13

Author(s):

Mikael Nilsson ◽

Mattias Dahl ◽

Ingvar Claesson

Keyword(s):

Speech Enhancement ◽

Stationary Noise ◽

Enhancement Method

Download Full-text

A New Microphone Array Speech Enhancement Method Based on AR Model

Lecture Notes in Computer Science - Life System Modeling and Intelligent Computing ◽

10.1007/978-3-642-15615-1_17 ◽

2010 ◽

pp. 139-147

Author(s):

Liyan Zhang ◽

Fuliang Yin ◽

Lijun Zhang

Keyword(s):

Speech Enhancement ◽

Microphone Array ◽

Ar Model ◽

Enhancement Method

Download Full-text