speech enhancement Latest Research Papers

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

Applied Acoustics ◽

10.1016/j.apacoust.2021.108499 ◽

2022 ◽

Vol 187 ◽

pp. 108499

Author(s):

Andong Li ◽

Chengshi Zheng ◽

Lu Zhang ◽

Xiaodong Li

Keyword(s):

Collaborative Learning ◽

Speech Enhancement ◽

Single Channel ◽

Learning Framework

Download Full-text

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

Sensors ◽

10.3390/s22010374 ◽

2022 ◽

Vol 22 (1) ◽

pp. 374

Author(s):

Mohamed Nabih Ali ◽

Daniele Falavigna ◽

Alessio Brutti

Keyword(s):

Speech Enhancement ◽

Time Domain ◽

Background Noise ◽

State Of The Art ◽

Noisy Environments ◽

Neural Models ◽

Convolutional Network ◽

Environmental Perturbations ◽

Front End ◽

The Time Domain

Robustness against background noise and reverberation is essential for many real-world speech-based applications. One way to achieve this robustness is to employ a speech enhancement front-end that, independently of the back-end, removes the environmental perturbations from the target speech signal. However, although the enhancement front-end typically increases the speech quality from an intelligibility perspective, it tends to introduce distortions which deteriorate the performance of subsequent processing modules. In this paper, we investigate strategies for jointly training neural models for both speech enhancement and the back-end, which optimize a combined loss function. In this way, the enhancement front-end is guided by the back-end to provide more effective enhancement. Differently from typical state-of-the-art approaches employing on spectral features or neural embeddings, we operate in the time domain, processing raw waveforms in both components. As application scenario we consider intent classification in noisy environments. In particular, the front-end speech enhancement module is based on Wave-U-Net while the intent classifier is implemented as a temporal convolutional network. Exhaustive experiments are reported on versions of the Fluent Speech Commands corpus contaminated with noises from the Microsoft Scalable Noisy Speech Dataset, shedding light and providing insight about the most promising training approaches.

Download Full-text

A two-stage deep neuroevolutionary technique for self-adaptive speech enhancement

IEEE Access ◽

10.1109/access.2022.3140901 ◽

2022 ◽

pp. 1-1

Author(s):

Ryan LeBlanc ◽

Sid Ahmed Selouani

Keyword(s):

Speech Enhancement ◽

Two Stage ◽

Self Adaptive

Download Full-text

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

Trends in Hearing ◽

10.1177/23312165211068629 ◽

2022 ◽

Vol 26 ◽

pp. 233121652110686

Author(s):

Tim Green ◽

Gaston Hilkhuysen ◽

Mark Huckvale ◽

Stuart Rosen ◽

Mike Brookes ◽

...

Keyword(s):

Speech Enhancement ◽

Hearing Aids ◽

Spatial Information ◽

Signal To Noise Ratio ◽

Target Position ◽

Spatial Diversity ◽

Processing Scheme ◽

Time Frequency ◽

Sentence Recognition ◽

Enhancement Method

A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here.

Download Full-text

Speech Enhancement Using Nonlinear Kalman Filtering

Lecture Notes in Networks and Systems - Inventive Communication and Computational Technologies ◽

10.1007/978-981-16-5529-6_45 ◽

2022 ◽

pp. 575-585

Author(s):

T. Namratha ◽

B. Indra Kiran Reddy ◽

M. V. Deepak Chand Reddy ◽

P. Sudheesh

Keyword(s):

Speech Enhancement ◽

Kalman Filtering ◽

Nonlinear Kalman Filtering

Download Full-text

An Efficient Reference Free Adaptive Learning Process for Speech Enhancement Applications

Computers Materials & Continua ◽

10.32604/cmc.2022.020160 ◽

2022 ◽

Vol 70 (2) ◽

pp. 3067-3080

Author(s):

Girika Jyoshna ◽

Md. Zia Ur Rahman ◽

L. Koteswararao

Keyword(s):

Speech Enhancement ◽

Learning Process ◽

Adaptive Learning

Download Full-text

Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

Applied Acoustics ◽

10.1016/j.apacoust.2021.108355 ◽

2022 ◽

Vol 185 ◽

pp. 108355

Author(s):

Sujan Kumar Roy ◽

Kuldip K. Paliwal

Keyword(s):

Kalman Filter ◽

Speech Enhancement ◽

Single Channel

Download Full-text

A New Speech Enhancement Technique Based on Stationary Bionic Wavelet Transform and MMSE Estimate of Spectral Amplitude

Security and Communication Networks ◽

10.1155/2021/9968275 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Mourad Talbi ◽

Med Salim Bouhlel

Keyword(s):

Wavelet Transform ◽

Speech Enhancement ◽

Hearing Aids ◽

Speech Signal ◽

Minimum Mean Square Error ◽

Voice Conversion ◽

Spectral Amplitude ◽

Wavelet Coefficients ◽

Enhancement Technique ◽

Perceptual Evaluation

Speech enhancement has gained considerable attention in the employment of speech transmission via the communication channel, speaker identification, speech-based biometric systems, video conference, hearing aids, mobile phones, voice conversion, microphones, and so on. The background noise processing is needed for designing a successful speech enhancement system. In this work, a new speech enhancement technique based on Stationary Bionic Wavelet Transform (SBWT) and Minimum Mean Square Error (MMSE) Estimate of Spectral Amplitude is proposed. This technique consists at the first step in applying the SBWT to the noisy speech signal, in order to obtain eight noisy wavelet coefficients. The denoising of each of those coefficients is performed through the application of the denoising method based on MMSE Estimate of Spectral Amplitude. The SBWT inverse, S B W T − 1 , is applied to the obtained denoised stationary wavelet coefficients for finally obtaining the enhanced speech signal. The proposed technique’s performance is proved by the calculation of the Signal to Noise Ratio (SNR), the Segmental SNR (SSNR), and the Perceptual Evaluation of Speech Quality (PESQ).

Download Full-text

Comparative Study of Deep Learning Techniques Used for Speech Enhancement

10.1109/iccca52192.2021.9666413 ◽

2021 ◽

Author(s):

Ajay S ◽

Manisha R ◽

Pranav Maheshkumar Nivarthi ◽

Sai Harsha Nadendla ◽

C Santhosh Kumar

Keyword(s):

Deep Learning ◽

Comparative Study ◽

Speech Enhancement ◽

Learning Techniques

Download Full-text

Research on speech enhancement algorithm based on microphone array

10.1117/12.2625015 ◽

2021 ◽

Author(s):

BaoTong Wu ◽

Shun He

Keyword(s):

Speech Enhancement ◽

Microphone Array ◽

Enhancement Algorithm

Download Full-text

speech enhancement
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

A two-stage deep neuroevolutionary technique for self-adaptive speech enhancement

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

Speech Enhancement Using Nonlinear Kalman Filtering

An Efficient Reference Free Adaptive Learning Process for Speech Enhancement Applications

Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

A New Speech Enhancement Technique Based on Stationary Bionic Wavelet Transform and MMSE Estimate of Spectral Amplitude

Comparative Study of Deep Learning Techniques Used for Speech Enhancement

Research on speech enhancement algorithm based on microphone array

Export Citation Format

speech enhancementRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models

A two-stage deep neuroevolutionary technique for self-adaptive speech enhancement

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

Speech Enhancement Using Nonlinear Kalman Filtering

An Efficient Reference Free Adaptive Learning Process for Speech Enhancement Applications

Robustness and sensitivity metrics-based tuning of the augmented Kalman filter for single-channel speech enhancement

A New Speech Enhancement Technique Based on Stationary Bionic Wavelet Transform and MMSE Estimate of Spectral Amplitude

Comparative Study of Deep Learning Techniques Used for Speech Enhancement

Research on speech enhancement algorithm based on microphone array

speech enhancement
Recently Published Documents