Impact of phase distortion and phase-insensitive speech enhancement on speech quality perceived by hearing-impaired listeners

2020 ◽  
Vol 148 (4) ◽  
pp. 2650-2650
Author(s):  
Zhuohuang Zhang ◽  
Donald S. Williamson ◽  
Yi Shen
Sensors ◽  
2021 ◽  
Vol 21 (5) ◽  
pp. 1878
Author(s):  
Yi Zhou ◽  
Haiping Wang ◽  
Yijing Chu ◽  
Hongqing Liu

The use of multiple spatially distributed microphones allows performing spatial filtering along with conventional temporal filtering, which can better reject the interference signals, leading to an overall improvement of the speech quality. In this paper, we propose a novel dual-microphone generalized sidelobe canceller (GSC) algorithm assisted by a bone-conduction (BC) sensor for speech enhancement, which is named BC-assisted GSC (BCA-GSC) algorithm. The BC sensor is relatively insensitive to the ambient noise compared to the conventional air-conduction (AC) microphone. Hence, BC speech can be analyzed to generate very accurate voice activity detection (VAD), even in a high noise environment. The proposed algorithm incorporates the VAD information obtained by the BC speech into the adaptive blocking matrix (ABM) and adaptive noise canceller (ANC) in GSC. By using VAD to control ABM and combining VAD with signal-to-interference ratio (SIR) to control ANC, the proposed method could suppress interferences and improve the overall performance of GSC significantly. It is verified by experiments that the proposed GSC system not only improves speech quality remarkably but also boosts speech intelligibility.


2021 ◽  
pp. 2150022
Author(s):  
Caio Cesar Enside de Abreu ◽  
Marco Aparecido Queiroz Duarte ◽  
Bruno Rodrigues de Oliveira ◽  
Jozue Vieira Filho ◽  
Francisco Villarreal

Speech processing systems are very important in different applications involving speech and voice quality such as automatic speech recognition, forensic phonetics and speech enhancement, among others. In most of them, the acoustic environmental noise is added to the original signal, decreasing the signal-to-noise ratio (SNR) and the speech quality by consequence. Therefore, estimating noise is one of the most important steps in speech processing whether to reduce it before processing or to design robust algorithms. In this paper, a new approach to estimate noise from speech signals is presented and its effectiveness is tested in the speech enhancement context. For this purpose, partial least squares (PLS) regression is used to model the acoustic environment (AE) and a Wiener filter based on a priori SNR estimation is implemented to evaluate the proposed approach. Six noise types are used to create seven acoustically modeled noises. The basic idea is to consider the AE model to identify the noise type and estimate its power to be used in a speech processing system. Speech signals processed using the proposed method and classical noise estimators are evaluated through objective measures. Results show that the proposed method produces better speech quality than state-of-the-art noise estimators, enabling it to be used in real-time applications in the field of robotic, telecommunications and acoustic analysis.


2007 ◽  
Vol 122 (2) ◽  
pp. 1150-1164 ◽  
Author(s):  
Kathryn H. Arehart ◽  
James M. Kates ◽  
Melinda C. Anderson ◽  
Lewis O. Harvey

2002 ◽  
Vol 111 (5) ◽  
pp. 2427
Author(s):  
Harikrishna P. Natarajan ◽  
Ashok K. Krishnamurthy ◽  
Lawrence L. Feth

2020 ◽  
pp. 2150017
Author(s):  
Bittu Kumar

In this paper, the performance of compressive sensing (CS)-based technique for speech enhancement has been studied and results analyzed with recovery algorithms as a comparison of their performances. This is done for several recovery algorithms such as matching pursuit, orthogonal matching pursuit, stage-wise orthogonal matching pursuit, compressive sampling matching pursuit and generalized orthogonal matching pursuit. Performances of all these greedy algorithms were compared for speech enhancement. The evaluation of results has been carried out using objective measures (perceptual evaluation of speech quality, log-likelihood ratio, weighted spectral slope distance and segmental signal-to-noise ratio), simulation time and composite objective measures (signal distortion C[Formula: see text], background intrusiveness C[Formula: see text] and overall quality C[Formula: see text]. Results showed that the CS-based technique using generalized orthogonal matching pursuit algorithm yields better performance than the other recovery algorithms in terms of speech quality and distortion.


2020 ◽  
Vol 10 (17) ◽  
pp. 6077
Author(s):  
Gyuseok Park ◽  
Woohyeong Cho ◽  
Kyu-Sung Kim ◽  
Sangmin Lee

Hearing aids are small electronic devices designed to improve hearing for persons with impaired hearing, using sophisticated audio signal processing algorithms and technologies. In general, the speech enhancement algorithms in hearing aids remove the environmental noise and enhance speech while still giving consideration to hearing characteristics and the environmental surroundings. In this study, a speech enhancement algorithm was proposed to improve speech quality in a hearing aid environment by applying noise reduction algorithms with deep neural network learning based on noise classification. In order to evaluate the speech enhancement in an actual hearing aid environment, ten types of noise were self-recorded and classified using convolutional neural networks. In addition, noise reduction for speech enhancement in the hearing aid were applied by deep neural networks based on the noise classification. As a result, the speech quality based on the speech enhancements removed using the deep neural networks—and associated environmental noise classification—exhibited a significant improvement over that of the conventional hearing aid algorithm. The improved speech quality was also evaluated by objective measure through the perceptual evaluation of speech quality score, the short-time objective intelligibility score, the overall quality composite measure, and the log likelihood ratio score.


Sign in / Sign up

Export Citation Format

Share Document