A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition

Robust Front-End for Multi-Channel ASR using Flow-Based Density Estimation

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/518 ◽

2020 ◽

Author(s):

Hyeongju Kim ◽

Hyeonseung Lee ◽

Woo Hyun Kang ◽

Hyung Yong Kim ◽

Nam Soo Kim

Keyword(s):

Speech Recognition ◽

Density Estimation ◽

Speech Enhancement ◽

Automatic Speech Recognition ◽

Optimization Techniques ◽

Data Simulation ◽

Noisy Speech ◽

Front End ◽

Novel Approach ◽

Parallel Data

For multi-channel speech recognition, speech enhancement techniques such as denoising or dereverberation are conventionally applied as a front-end processor. Deep learning-based front-ends using such techniques require aligned clean and noisy speech pairs which are generally obtained via data simulation. Recently, several joint optimization techniques have been proposed to train the front-end without parallel data within an end-to-end automatic speech recognition (ASR) scheme. However, the ASR objective is sub-optimal and insufficient for fully training the front-end, which still leaves room for improvement. In this paper, we propose a novel approach which incorporates flow-based density estimation for the robust front-end using non-parallel clean and noisy speech. Experimental results on the CHiME-4 dataset show that the proposed method outperforms the conventional techniques where the front-end is trained only with ASR objective.

Download Full-text

Dual Application of Speech Enhancement for Automatic Speech Recognition

2021 IEEE Spoken Language Technology Workshop (SLT) ◽

10.1109/slt48900.2021.9383624 ◽

2021 ◽

Author(s):

Ashutosh Pandey ◽

Chunxi Liu ◽

Yun Wang ◽

Yatharth Saraf

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition

Download Full-text

Comparing Front-End Enhancement Techniques and Multiconditioned Training for Robust Automatic Speech Recognition

Text, Speech, and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-030-27947-9_28 ◽

2019 ◽

pp. 329-340 ◽

Cited By ~ 1

Author(s):

Meet H. Soni ◽

Sonal Joshi ◽

Ashish Panda

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Front End

Download Full-text

Speech Enhancement System for Automatic Speech Recognition in Automotive Environment

10.1109/icccnt51525.2021.9579986 ◽

2021 ◽

Author(s):

Gokul G. Nair ◽

C. Santhosh Kumar

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition

Download Full-text

A Unified Front-end Anti-interference Approach for Robust Automatic Speech Recognition

2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) ◽

10.1109/isspit47144.2019.9001809 ◽

2019 ◽

Author(s):

Yunming Liang ◽

Yi Zhou ◽

Yongbao Ma ◽

Hongqing Liu

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Front End

Download Full-text

An MTF‐based blind restoration of temporal power envelopes as a front‐end processor for automatic speech recognition systems in reverberant environments

The Journal of the Acoustical Society of America ◽

10.1121/1.2933278 ◽

2008 ◽

Vol 123 (5) ◽

pp. 3180-3180

Author(s):

Xugang Lu ◽

Masashi Unoki ◽

Masato Akagi

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Front End ◽

Reverberant Environments ◽

Blind Restoration ◽

Recognition Systems

Download Full-text

Auditory-Inspired Morphological Processing of Speech Spectrograms: Applications in Automatic Speech Recognition and Speech Enhancement

Cognitive Computation ◽

10.1007/s12559-012-9196-6 ◽

2012 ◽

Vol 5 (4) ◽

pp. 426-441 ◽

Cited By ~ 7

Author(s):

Joyner Cadore ◽

Francisco J. Valverde-Albacete ◽

Ascensión Gallardo-Antolín ◽

Carmen Peláez-Moreno

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition ◽

Morphological Processing

Download Full-text

Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization

EURASIP Journal on Audio Speech and Music Processing ◽

10.1155/2008/148967 ◽

2008 ◽

Vol 2008 ◽

pp. 1-13 ◽

Cited By ~ 2

Author(s):

Umit H. Yapanel ◽

John H. L. Hansen

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Speaker Normalization ◽

Front End

Download Full-text

Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2015.2450491 ◽

2015 ◽

Vol 23 (11) ◽

pp. 1788-1799 ◽

Cited By ~ 13

Author(s):

Deepak Baby ◽

Tuomas Virtanen ◽

Jort F. Gemmeke ◽

Hugo Van hamme

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Automatic Speech Recognition

Download Full-text

A Companding Front End for Noise-Robust Automatic Speech Recognition

Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. ◽

10.1109/icassp.2005.1415097 ◽

2006 ◽

Cited By ~ 1

Author(s):

J. Guinness ◽

B. Raj ◽

B. Schmidt-Nielsen ◽

L. Turicchia ◽

R. Sarpeshkar

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Front End ◽

Noise Robust

Download Full-text