speech separation Latest Research Papers

Exploring single channel speech separation for short-time text-dependent speaker verification

International Journal of Speech Technology ◽

10.1007/s10772-022-09959-8 ◽

2022 ◽

Author(s):

Jiangyu Han ◽

Yan Shi ◽

Yanhua Long ◽

Jiaen Liang

Keyword(s):

Single Channel ◽

Speaker Verification ◽

Speech Separation ◽

Short Time ◽

Text Dependent Speaker Verification

Role of Speech Separation in Verifying the Speaker Under Degraded Conditions Using EMD and Hilbert Transform

Algorithms for Intelligent Systems - Proceedings of the International Conference on Paradigms of Communication, Computing and Data Sciences ◽

10.1007/978-981-16-5747-4_27 ◽

2022 ◽

pp. 309-324

Author(s):

M. K. Prasanna Kumar ◽

R. Kumaraswamy

Keyword(s):

Hilbert Transform ◽

Speech Separation

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2022.3140556 ◽

2022 ◽

pp. 1-1

Author(s):

Zengwei Yao ◽

Wenjie Pei ◽

Fanglin Chen ◽

Guangming Lu ◽

David Zhang

Keyword(s):

High Order ◽

Speech Separation ◽

Fine Grained

Enhancing the correlation between the quality and intelligibility objective metrics with the subjective scores by shallow feed forward neural network for time–frequency masking speech separation algorithms

Applied Acoustics ◽

10.1016/j.apacoust.2021.108539 ◽

2022 ◽

Vol 188 ◽

pp. 108539

Author(s):

Sania Gul ◽

Muhammad Salman Khan ◽

Néstor Becerra Yoma ◽

Syed Waqar Shah ◽

Sheheryar

Keyword(s):

Neural Network ◽

Speech Separation ◽

Feed Forward Neural Network ◽

Feed Forward ◽

Time Frequency ◽

Objective Metrics

Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit

EURASIP Journal on Audio Speech and Music Processing ◽

10.1186/s13636-021-00234-3 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Jiacheng Yao ◽

Jing Zhang ◽

Jiafeng Li ◽

Li Zhuo

Keyword(s):

Video Streaming ◽

Feature Vector ◽

The Self ◽

Live Streaming ◽

Network Environment ◽

Speech Separation ◽

Attention Network ◽

Feature Sequence ◽

Gated Recurrent Unit ◽

Great Harm

AbstractWith the sharp booming of online live streaming platforms, some anchors seek profits and accumulate popularity by mixing inappropriate content into live programs. After being blacklisted, these anchors even forged their identities to change the platform to continue live, causing great harm to the network environment. Therefore, we propose an anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit (GRU) for anchor identification of live platform. First, the speech of the anchor is extracted from the live streaming by using voice activation detection (VAD) and speech separation. Then, the feature sequence of anchor voiceprint is generated from the speech waveform with the self-attention network RawNet-SA. Finally, the feature sequence of anchor voiceprint is aggregated by GRU to transform into a deep voiceprint feature vector for anchor recognition. Experiments are conducted on the VoxCeleb, CN-Celeb, and MUSAN dataset, and the competitive results demonstrate that our method can effectively recognize the anchor voiceprint in video streaming.

Speech Separation Based on DPTNet with Sparse Attention

10.1109/ic-nidc54101.2021.9660488 ◽

2021 ◽

Author(s):

Beom Jun Woo ◽

Hyung Yong Kim ◽

Jeunghun Kim ◽

Nam Soo Kim

Keyword(s):

Speech Separation

Parameter Estimation of Source Image Using Least-Squares for Dual Channel Underdetermined Convolutive Blind Speech Separation

Journal of the Korea Academia-Industrial cooperation Society ◽

10.5762/kais.2021.22.10.544 ◽

2021 ◽

Vol 22 (10) ◽

pp. 544-551

Author(s):

Jounghoon Beh

Keyword(s):

Parameter Estimation ◽

Least Squares ◽

Source Image ◽

Speech Separation ◽

Dual Channel

Deep Variational Generative Models for Audio-Visual Speech Separation

10.1109/mlsp52302.2021.9596406 ◽

2021 ◽

Author(s):

Viet-Nhat Nguyen ◽

Mostafa Sadeghi ◽

Elisa Ricci ◽

Xavier Alameda-Pineda

Keyword(s):

Generative Models ◽

Visual Speech ◽

Speech Separation

Convolutive Prediction for Reverberant Speech Separation

10.1109/waspaa52581.2021.9632667 ◽

2021 ◽

Author(s):

Zhong-Qiu Wang ◽

Gordon Wichern ◽

Jonathan Le Roux

Keyword(s):

Speech Separation ◽

Reverberant Speech

DPTCN-ATPP: Multi-scale End-to-end Modeling for Single-channel Speech Separation

10.1109/iccis53528.2021.9645957 ◽

2021 ◽

Author(s):

Yanmin Zhu ◽

Xiang Zheng ◽

Xinrong Wu ◽

Wanning Liu ◽

Lei Pi ◽

...

Keyword(s):

Single Channel ◽

Speech Separation ◽

Multi Scale ◽

End To End

speech separation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Exploring single channel speech separation for short-time text-dependent speaker verification

Role of Speech Separation in Verifying the Speaker Under Degraded Conditions Using EMD and Hilbert Transform

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain

Enhancing the correlation between the quality and intelligibility objective metrics with the subjective scores by shallow feed forward neural network for time–frequency masking speech separation algorithms

Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit

Speech Separation Based on DPTNet with Sparse Attention

Parameter Estimation of Source Image Using Least-Squares for Dual Channel Underdetermined Convolutive Blind Speech Separation

Deep Variational Generative Models for Audio-Visual Speech Separation

Convolutive Prediction for Reverberant Speech Separation

DPTCN-ATPP: Multi-scale End-to-end Modeling for Single-channel Speech Separation

Export Citation Format

speech separationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Exploring single channel speech separation for short-time text-dependent speaker verification

Role of Speech Separation in Verifying the Speaker Under Degraded Conditions Using EMD and Hilbert Transform

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-order Latent Domain

Enhancing the correlation between the quality and intelligibility objective metrics with the subjective scores by shallow feed forward neural network for time–frequency masking speech separation algorithms

Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit

Speech Separation Based on DPTNet with Sparse Attention

Parameter Estimation of Source Image Using Least-Squares for Dual Channel Underdetermined Convolutive Blind Speech Separation

Deep Variational Generative Models for Audio-Visual Speech Separation

Convolutive Prediction for Reverberant Speech Separation

DPTCN-ATPP: Multi-scale End-to-end Modeling for Single-channel Speech Separation

speech separation
Recently Published Documents