Permutation invariant training of deep models for speaker-independent multi-talker speech separation

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2017.7952154 ◽

2017 ◽

Author(s):

Dong Yu ◽

Morten Kolbaek ◽

Zheng-Hua Tan ◽

Jesper Jensen

Keyword(s):

Speech Separation ◽

Speaker Independent

Download Full-text

Low-latency Speaker-independent Continuous Speech Separation

ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2019.8682274 ◽

2019 ◽

Author(s):

Takuya Yoshioka ◽

Zhuo Chen ◽

Changliang Liu ◽

Xiong Xiao ◽

Hakan Erdogan ◽

...

Keyword(s):

Low Latency ◽

Speech Separation ◽

Continuous Speech ◽

Speaker Independent

Download Full-text

CASA BASED SUPERVISED SINGLE CHANNEL SPEAKER INDEPENDENT SPEECH SEPARATION

JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES ◽

10.26782/jmcms.2019.12.00074 ◽

2019 ◽

Vol 14 (6) ◽

Author(s):

M.Fazal Ur Rehman

Keyword(s):

Single Channel ◽

Speech Separation ◽

Speaker Independent

Download Full-text

DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation

10.21437/interspeech.2018-2516 ◽

2018 ◽

Author(s):

Mandar Gogate ◽

Ahsan Adeel ◽

Ricard Marxer ◽

Jon Barker ◽

Amir Hussain

Keyword(s):

Speech Separation ◽

Visual Mask ◽

Speaker Independent ◽

Mask Estimation

Download Full-text

Speaker-Independent Speech Separation With Deep Attractor Network

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2018.2795749 ◽

2018 ◽

Vol 26 (4) ◽

pp. 787-796 ◽

Author(s):

Yi Luo ◽

Zhuo Chen ◽

Nima Mesgarani

Keyword(s):

Speech Separation ◽

Attractor Network ◽

Speaker Independent

Download Full-text

Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8462603 ◽

2018 ◽

Author(s):

Qingju Liu ◽

Yong Xu ◽

Philip JB Jackson ◽

Wenwu Wang ◽

Philip Coleman

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Speech Separation ◽

Speaker Independent

Download Full-text

Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461639 ◽

2018 ◽

Author(s):

Zhong-Qiu Wang ◽

Jonathan Le Roux ◽

John R. Hershey

Keyword(s):

Speech Separation ◽

Speaker Independent

Download Full-text

Deep neural networks based binary classification for single channel speaker independent multi-talker speech separation

Applied Acoustics ◽

10.1016/j.apacoust.2020.107385 ◽

2020 ◽

Vol 167 ◽

pp. 107385 ◽

Author(s):

Nasir Saleem ◽

Muhammad Irfan Khattak

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Single Channel ◽

Binary Classification ◽

Speech Separation ◽

Speaker Independent

Download Full-text

Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/605 ◽

2018 ◽

Author(s):

Jing Shi ◽

Jiaming Xu ◽

Guangcan Liu ◽

Bo Xu

Keyword(s):

Neural Network ◽

Auditory Attention ◽

Speech Separation ◽

Computation Complexity ◽

Significant Progress ◽

Top Down ◽

Satisfactory Solution ◽

Speaker Independent ◽

Auditory Scene ◽

Permutation Problem

Recent deep learning methods have made significant progress in multi-talker mixed speech separation. However, most existing models adopt a driftless strategy to separate all the speech channels rather than selectively attend the target one. As a result, those frameworks may be failed to offer a satisfactory solution in complex auditory scene where the number of input sounds is usually uncertain and even dynamic. In this paper, we present a novel neural network based structure motivated by the top-down attention behavior of human when facing complicated acoustical scene. Different from previous works, our method constructs an inference-attention structure to predict interested candidates and extract each speech channel of them. Our work gets rid of the limitation that the number of channels must be given or the high computation complexity for label permutation problem. We evaluated our model on the WSJ0 mixed-speech tasks. In all the experiments, our model gets highly competitive to reach and even outperform the baselines.

Download Full-text

CBLDNN-Based Speaker-Independent Speech Separation Via Generative Adversarial Training

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8462505 ◽

2018 ◽

Author(s):

Chenxing Li ◽

Lei Zhu ◽

Shuang Xu ◽

Peng Gao ◽

Bo Xu

Keyword(s):

Speech Separation ◽

Speaker Independent ◽

Adversarial Training

Download Full-text

A Casa Approach to Deep Learning Based Speaker-Independent Co-Channel Speech Separation

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461477 ◽

2018 ◽

Author(s):

Yuzhou Liu ◽

DeLiang Wang

Keyword(s):

Deep Learning ◽

Speech Separation ◽

Speaker Independent

Download Full-text