Incoherent Discriminative Dictionary Learning for Speech Enhancement

Speech enhancement is one of the many challenging tasks in signal processing, especially in the case of nonstationary speech-like noise. In this paper a new incoherent discriminative dictionary learning algorithm is proposed to model both speech and noise, where the cost function accounts for both “source confusion” and “source distortion” errors, with a regularization term that penalizes the coherence between speech and noise sub-dictionaries. At the enhancement stage, we use sparse coding on the learnt dictionary to ﬁnd an estimate for both clean speech and noise amplitude spectrum. In the ﬁnal phase, the Wiener ﬁlter is used to reﬁne the clean speech estimate. Experiments on the Noizeus dataset, using two objective speech enhancement measures: frequency-weighted segmental SNR and Perceptual Evaluation of Speech Quality (PESQ) demonstrate that the proposed algorithm outperforms other speech enhancement methods tested.

Download Full-text

Dual-Mic Speech Enhancement Based on TF-GSC with Leakage Suppression and Signal Recovery

Applied Sciences ◽

10.3390/app11062816 ◽

2021 ◽

Vol 11 (6) ◽

pp. 2816

Author(s):

Hansol Kim ◽

Jong Won Shin

Keyword(s):

Speech Enhancement ◽

Wiener Filter ◽

Signal Recovery ◽

Gain Function ◽

Microphone Signal ◽

Perceptual Evaluation ◽

Blocking Matrix ◽

Adaptive Noise ◽

Adaptive Noise Canceller ◽

Sidelobe Canceller

The transfer function-generalized sidelobe canceller (TF-GSC) is one of the most popular structures for the adaptive beamformer used in multi-channel speech enhancement. Although the TF-GSC has shown decent performance, a certain amount of steering error is inevitable, which causes leakage of speech components through the blocking matrix (BM) and distortion in the fixed beamformer (FBF) output. In this paper, we propose to suppress the leaked signal in the output of the BM and restore the desired signal in the FBF output of the TF-GSC. To reduce the risk of attenuating speech in the adaptive noise canceller (ANC), the speech component in the output of the BM is suppressed by applying a gain function similar to the square-root Wiener filter, assuming that a certain portion of the desired speech should be leaked into the BM output. Additionally, we propose to restore the attenuated desired signal in the FBF output by adding some of the microphone signal components back, depending on how microphone signals are related to the FBF and BM outputs. The experimental results showed that the proposed TF-GSC outperformed conventional TF-GSC in terms of the perceptual evaluation of speech quality (PESQ) scores under various noise conditions and the direction of arrivals for the desired and interfering sources.

Download Full-text

Discriminative dictionary learning algorithm with pairwise local constraints for histopathological image classification

Medical & Biological Engineering & Computing ◽

10.1007/s11517-020-02281-y ◽

2021 ◽

Author(s):

Hongzhong Tang ◽

Lizhen Mao ◽

Shuying Zeng ◽

Shijun Deng ◽

Zhaoyang Ai

Keyword(s):

Image Classification ◽

Dictionary Learning ◽

Learning Algorithm ◽

Local Constraints ◽

Histopathological Image ◽

Histopathological Image Classification ◽

Discriminative Dictionary Learning

Download Full-text

Single Channel Speech Enhancement using Wiener Filter and Compressive Sensing

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v7i4.pp1941-1951 ◽

2017 ◽

Vol 7 (4) ◽

pp. 1941

Author(s):

Amart Sulong ◽

Teddy Surya Gunawan ◽

Othman O Khalifa ◽

Mira Kartiwi ◽

Hassan Dao

Keyword(s):

Compressive Sensing ◽

Noise Reduction ◽

Speech Enhancement ◽

Single Channel ◽

Objective Assessment ◽

Amplitude Spectrum ◽

Wiener Filter ◽

Speech Quality ◽

Interactive Effects ◽

Assessment Tests

<table width="593" border="0" cellspacing="0" cellpadding="0"><tbody><tr><td valign="top" width="387"><p class="Text">The speech enhancement algorithms are utilized to overcome multiple limitation factors in recent applications such as mobile phone and communication channel. The challenges focus on corrupted speech solution between noise reduction and signal distortion. We used a modified Wiener filter and compressive sensing (CS) to investigate and evaluate the improvement of speech quality. This new method adapted noise estimation and Wiener filter gain function in which to increase weight amplitude spectrum and improve mitigation of interested signals. The CS is then applied using the gradient projection for sparse reconstruction (GPSR) technique as a study system to empirically investigate the interactive effects of the corrupted noise and obtain better perceptual improvement aspects to listener fatigue with noiseless reduction conditions. The proposed algorithm shows an enhancement in testing performance evaluation of objective assessment tests outperform compared to other conventional algorithms at various noise type conditions of 0, 5, 10, 15 dB SNRs. Therefore, the proposed algorithm significantly achieved the speech quality improvement and efficiently obtained higher performance resulting in better noise reduction compare to other conventional algorithms. </p></td></tr></tbody></table>

Download Full-text

An interactively constrained discriminative dictionary learning algorithm for image classification

Engineering Applications of Artificial Intelligence ◽

10.1016/j.engappai.2018.04.006 ◽

2018 ◽

Vol 72 ◽

pp. 241-252 ◽

Cited By ~ 6

Author(s):

Zhengming Li ◽

Zheng Zhang ◽

Zizhu Fan ◽

Jie Wen

Keyword(s):

Image Classification ◽

Dictionary Learning ◽

Learning Algorithm ◽

Discriminative Dictionary Learning

Download Full-text

Discriminative dictionary learning algorithm based on sample diversity and locality of atoms for face recognition

Journal of Visual Communication and Image Representation ◽

10.1016/j.jvcir.2020.102763 ◽

2020 ◽

Vol 71 ◽

pp. 102763 ◽

Cited By ~ 3

Author(s):

Shigang Liu ◽

Yuhong Wang ◽

Xiaosheng Wu ◽

Jun Li ◽

Tao Lei

Keyword(s):

Face Recognition ◽

Dictionary Learning ◽

Learning Algorithm ◽

Discriminative Dictionary Learning

Download Full-text

Sparsity Based Locality-Sensitive Discriminative Dictionary Learning for Video Semantic Analysis

Mathematical Problems in Engineering ◽

10.1155/2018/9312563 ◽

2018 ◽

Vol 2018 ◽

pp. 1-11 ◽

Cited By ~ 1

Author(s):

Ben-Bright Benuwa ◽

Yongzhao Zhan ◽

Benjamin Ghansah ◽

Ernest K. Ansah ◽

Andriana Sarkodie

Keyword(s):

Dictionary Learning ◽

Semantic Analysis ◽

Learning Algorithm ◽

Recognition Rate ◽

Image Data ◽

Classification Performance ◽

Video Data ◽

Video Feature ◽

Video Semantic Analysis ◽

Discriminative Dictionary Learning

Dictionary learning (DL) and sparse representation (SR) based classifiers have greatly impacted the classification performance and have had good recognition rate on image data. In video semantic analysis (VSA), the local structure of video data contains more vital discriminative information needed for classification. However, this has not been fully exploited by the current DL based approaches. Besides, similar coding findings are not being realized from video features with the same video category. Based on the issues stated afore, a novel learning algorithm, called sparsity based locality-sensitive discriminative dictionary learning (SLSDDL) for VSA is proposed in this paper. In the proposed algorithm, a discriminant loss function for the category based on sparse coding of the sparse coefficients is introduced into structure of locality-sensitive dictionary learning (LSDL) algorithm. Finally, the sparse coefficients for the testing video feature sample are solved by the optimized method of SLSDDL and the classification result for video semantic is obtained by minimizing the error between the original and reconstructed samples. The experiment results show that the proposed SLSDDL significantly improves the performance of video semantic detection compared with the comparative state-of-the-art approaches. Moreover, the robustness to various diverse environments in video is also demonstrated, which proves the universality of the novel approach.

Download Full-text

A test sample oriented two-phase discriminative dictionary learning algorithm for face recognition

Intelligent Data Analysis ◽

10.3233/ida-150296 ◽

2016 ◽

Vol 20 (6) ◽

pp. 1405-1423 ◽

Cited By ~ 1

Author(s):

Zhengming Li ◽

Qi Zhu ◽

Yan Chen

Keyword(s):

Face Recognition ◽

Dictionary Learning ◽

Learning Algorithm ◽

Test Sample ◽

Two Phase ◽

Discriminative Dictionary Learning

Download Full-text

A novel single channel speech enhancement approach by combining Wiener filter and dictionary learning

2013 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2013.6639355 ◽

2013 ◽

Cited By ~ 1

Author(s):

Hung-Wei Tseng ◽

Srikanth Vishnubhotla ◽

Mingyi Hong ◽

Jinjun Xiao ◽

Zhi-Quan Luo ◽

...

Keyword(s):

Speech Enhancement ◽

Dictionary Learning ◽

Single Channel ◽

Wiener Filter

Download Full-text

A Weighted Block Dictionary Learning Algorithm for Classification

Mathematical Problems in Engineering ◽

10.1155/2016/3824027 ◽

2016 ◽

Vol 2016 ◽

pp. 1-15 ◽

Cited By ~ 3

Author(s):

Zhongrong Shi

Keyword(s):

Dictionary Learning ◽

Learning Algorithm ◽

State Of The Art ◽

Critical Role ◽

Learning Method ◽

Discriminative Power ◽

Single Class ◽

Weight Value ◽

Label Information ◽

Discriminative Dictionary Learning

Discriminative dictionary learning, playing a critical role in sparse representation based classification, has led to state-of-the-art classification results. Among the existing discriminative dictionary learning methods, two different approaches, shared dictionary and class-specific dictionary, which associate each dictionary atom to all classes or a single class, have been studied. The shared dictionary is a compact method but with lack of discriminative information; the class-specific dictionary contains discriminative information but consists of redundant atoms among different class dictionaries. To combine the advantages of both methods, we propose a new weighted block dictionary learning method. This method introduces proto dictionary and class dictionary. The proto dictionary is a base dictionary without label information. The class dictionary is a class-specific dictionary, which is a weighted proto dictionary. The weight value indicates the contribution of each proto dictionary block when constructing a class dictionary. These weight values can be computed conveniently as they are designed to adapt sparse coefficients. Different class dictionaries have different weight vectors but share the same proto dictionary, which results in higher discriminative power and lower redundancy. Experimental results demonstrate that the proposed algorithm has better classification results compared with several dictionary learning algorithms.

Download Full-text

Noise learning based discriminative dictionary learning algorithm for image classification

Journal of the Franklin Institute ◽

10.1016/j.jfranklin.2020.01.007 ◽

2020 ◽

Vol 357 (4) ◽

pp. 2492-2513 ◽

Cited By ~ 1

Author(s):

Tian Zhou ◽

Yunyi Li ◽

Guan Gui

Keyword(s):

Image Classification ◽

Dictionary Learning ◽

Learning Algorithm ◽

Discriminative Dictionary Learning

Download Full-text