Noisy speech enhancement based on an adaptive threshold and a modified hard thresholding function in wavelet packet domain

Speech enhancement is very important for mobile communications or some other applications in car. The energy distribution of signal is the basis of algorithms which denoise noisy speech in time-frequency domain. In this work, the noise regarded is the tire-road noise when driving in expressway. Wavelet packets transform is used in the analysis. After decomposing noise signal and noisy speech signal by wavelet packet transform, the analysis for the difference of the energy distribution between noisy speech and noise is finished.

Download Full-text

An approach for noisy speech enhancement by employing the teager energy operation on wavelet packet

TENCON 2011 - 2011 IEEE Region 10 Conference ◽

10.1109/tencon.2011.6129289 ◽

2011 ◽

Author(s):

Tahsina Farah Sanam ◽

Celia Shahnaz

Keyword(s):

Speech Enhancement ◽

Wavelet Packet ◽

Noisy Speech ◽

Teager Energy

Download Full-text

Speech Enhancement Based on Student $t$ Modeling of Teager Energy Operated Perceptual Wavelet Packet Coefficients and a Custom Thresholding Function

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2015.2443983 ◽

2015 ◽

Vol 23 (11) ◽

pp. 1800-1811 ◽

Cited By ~ 15

Author(s):

Md Tauhidul Islam ◽

Celia Shahnaz ◽

Wei-Ping Zhu ◽

M. Omair Ahmad

Keyword(s):

Speech Enhancement ◽

Wavelet Packet ◽

Teager Energy ◽

Thresholding Function

Download Full-text

A Non-Contact Speech Enhancement Algorithm Based on Wavelet Packet Adaptive Threshold

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.241-244.194 ◽

2012 ◽

Vol 241-244 ◽

pp. 194-198

Author(s):

Hui Jun Xue ◽

Sheng Li ◽

Teng Jiao ◽

Guo Hua Lu ◽

Yang Zhang ◽

...

Keyword(s):

Speech Enhancement ◽

Speech Signal ◽

Wavelet Packet ◽

Wiener Filter ◽

Adaptive Threshold ◽

Human Communication ◽

Frequency Signal ◽

Time Frequency ◽

Detecting Method ◽

Enhancement Algorithm

Speech is an important method for human communication. In this paper, we developed a new method for detecting speech signal. Because of the advantage of this speech detecting method, it has great potential application value in many fields. Simultaneously, basing on the good capability of wavelet packet for analyzing time-frequency signal, this paper also developed an algorithm of wavelet packet threshold by using hard threshold and soft threshold for removing noise. Comparing to spectral subtraction and Wiener filter speech enhancement algorithm, the proposed algorithm takes on a better performance on noise removing and speech signal reserving.

Download Full-text

Speech enhancement using adaptive thresholding based on gamma distribution of Teager energy operated intrinsic mode functions

TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES ◽

10.3906/elk-1804-18 ◽

2019 ◽

pp. 1355-1370

Author(s):

ÖZKAN ARSLAN ◽

ERKAN ZEKİ ENGİN

Keyword(s):

Speech Enhancement ◽

Adaptive Threshold ◽

Intrinsic Mode Functions ◽

Wavelet Shrinkage ◽

Noisy Speech ◽

Perceptual Evaluation ◽

Shrinkage Methods ◽

Speech Database ◽

Leibler Divergence ◽

Teager Energy

This paper introduces a new speech enhancement algorithm based on the adaptive threshold of intrinsic mode functions (IMFs) of noisy signal frames extracted by empirical mode decomposition. Adaptive threshold values are estimated by using the gamma statistical model of Teager energy operated IMFs of noisy speech and estimated noise based on symmetric Kullback–Leibler divergence. The enhanced speech signal is obtained by a semisoft thresholding function, which is utilized by threshold IMF coefficients of noisy speech. The method is tested on the NOIZEUS speech database and the proposed method is compared with wavelet-shrinkage and EMD-shrinkage methods in terms of segmental SNR improvement (SegSNR), weighted spectral slope (WSS), and perceptual evaluation of speech quality (PESQ). Experimental results show that the proposed method provides a higher SegSNR improvement in dB, lower WSS distance, and higher PESQ scores than wavelet-shrinkage and EMD-shrinkage methods. The proposed method shows better performance than traditional threshold-based speech enhancement approaches from high to low SNR levels.

Download Full-text

Robustness and Sensitivity Tuning of the Kalman Filter for Speech Enhancement

Signals ◽

10.3390/signals2030027 ◽

2021 ◽

Vol 2 (3) ◽

pp. 434-455

Author(s):

Sujan Kumar Roy ◽

Kuldip K. Paliwal

Keyword(s):

Kalman Filter ◽

Speech Enhancement ◽

Linear Prediction ◽

Real Life ◽

Model Parameters ◽

Noise Variance ◽

Noisy Speech ◽

Kalman Gain ◽

Whitening Filter ◽

Prediction Coefficient

Inaccurate estimates of the linear prediction coefficient (LPC) and noise variance introduce bias in Kalman filter (KF) gain and degrade speech enhancement performance. The existing methods propose a tuning of the biased Kalman gain, particularly in stationary noise conditions. This paper introduces a tuning of the KF gain for speech enhancement in real-life noise conditions. First, we estimate noise from each noisy speech frame using a speech presence probability (SPP) method to compute the noise variance. Then, we construct a whitening filter (with its coefficients computed from the estimated noise) to pre-whiten each noisy speech frame prior to computing the speech LPC parameters. We then construct the KF with the estimated parameters, where the robustness metric offsets the bias in KF gain during speech absence of noisy speech to that of the sensitivity metric during speech presence to achieve better noise reduction. The noise variance and the speech model parameters are adopted as a speech activity detector. The reduced-biased Kalman gain enables the KF to minimize the noise effect significantly, yielding the enhanced speech. Objective and subjective scores on the NOIZEUS corpus demonstrate that the enhanced speech produced by the proposed method exhibits higher quality and intelligibility than some benchmark methods.

Download Full-text

Weighted codebook mapping for noisy speech enhancement using harmonic-noise model

10.21437/interspeech.2006-70 ◽

2006 ◽

Author(s):

Esfandiar Zavarehei ◽

Saeed Vaseghi ◽

Qin Yan

Keyword(s):

Speech Enhancement ◽

Noise Model ◽

Noisy Speech ◽

Harmonic Noise

Download Full-text

Speech Enhancement Using Hilbert Spectrum and Wavelet Packet Based Soft-Thresholding

Science Journal of Circuits Systems and Signal Processing ◽

10.11648/j.cssp.20150401.12 ◽

2015 ◽

Vol 4 (1) ◽

pp. 1

Author(s):

Md. Ekramul Hamid

Keyword(s):

Speech Enhancement ◽

Wavelet Packet ◽

Hilbert Spectrum ◽

Soft Thresholding

Download Full-text

An architecture for wavelet-packet based speech enhancement for hearing aids

2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100) ◽

10.1109/icassp.2000.859093 ◽

2002 ◽

Cited By ~ 2

Author(s):

M.A. Trenas ◽

J. Lopez ◽

E.L. Zapata ◽

F. Arguello

Keyword(s):

Speech Enhancement ◽

Hearing Aids ◽

Wavelet Packet

Download Full-text

Speech Enhancement Using Neuro-Fuzzy Classifier

Advances in Data Mining and Database Management - Handbook of Research on Automated Feature Engineering and Advanced Applications in Data Science ◽

10.4018/978-1-7998-6659-6.ch009 ◽

2021 ◽

pp. 164-181

Author(s):

Judith Justin ◽

Vanithamani R.

Keyword(s):

Feature Extraction ◽

Speech Enhancement ◽

The Other ◽

Objective Measures ◽

Noise Levels ◽

Fuzzy Classifier ◽

Noisy Speech ◽

Enhancement Technique ◽

Time Frequency ◽

Neuro Fuzzy

In this chapter, a speech enhancement technique is implemented using a neuro-fuzzy classifier. Noisy speech sentences from NOIZEUS and AURORA databases are taken for the study. Feature extraction is implemented through modifications in amplitude magnitude spectrograms. A four class neuro-fuzzy classifier splits the noisy speech samples into noise-only part, signal only part, more noise-less signal part, and more signal-less noise part of the time-frequency units. Appropriate weights are applied in the enhancement phase. The enhanced speech sentence is evaluated using objective measures. An analysis of the performance of the Neuro-Fuzzy 4 (NF 4) classifier is done. A comparison of the performance of the classifier with other conventional techniques is done for various noises at different noise levels. It is observed that the numerical values of the measures obtained are better when compared to the others. An overall comparison of the performance of the NF 4 classifier is done and it is inferred that NF4 outperforms the other techniques in speech enhancement.

Download Full-text