Dual and joint estimation for speech enhancement

Maintaining good quality and intelligibility of speech is the primary constraint in mobile communications. The present work is on the enhancement of speech under the consideration of additive white and colored noise environments using Kalman filter. Dual and Joint estimation techniques were applied and the quality of speech is analyzed through the signal to noise ratio. The techniques were applied in both ideal and practical cases for two different speech samples.

Download Full-text

Research on the Speech Enhancement Method based on PCA/KLT Algorithms

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.239-240.1274 ◽

2012 ◽

Vol 239-240 ◽

pp. 1274-1278

Author(s):

Guang Yan Wang ◽

Yan Xiang Geng ◽

Xiao Qun Zhao

Keyword(s):

Speech Enhancement ◽

High Performance ◽

Large Scale ◽

Colored Noise ◽

Minimum Description Length ◽

Signal To Noise Ratio ◽

Principal Component ◽

White And Colored Noise ◽

Karhunen Loeve Transform ◽

Subspace Selection

In this paper, we propose a speech enhancement technique in terms of subspace methods to reduce the white or colored noise in strong background noise environment. This subspace approach based on Karhunen-Loève transform (KLT) and implemented via Principal Component Analysis (PCA). The subspace selection provided by the minimum description length (MDL) criterion. An offset factor generated from the white noise was used to modify the variance to adapt to the specified colored noise. The objective speech quality measures SegSNR have been introduced to evaluate the performance of the proposed method in time domain. A large amount of data and figures testify that our algorithm provides high performance for a large scale of input signal-to-noise ratio (-5~10dB). The performance of our algorithm is assessed in white and colored noise.

Download Full-text

Image Quality Enhancing by Efficient Histogram Equalization

Wasit Journal of Engineering Sciences ◽

10.31185/ejuow.vol2.iss2.29 ◽

2014 ◽

Vol 2 (2) ◽

pp. 47-58

Author(s):

Ismail Sh. Baqer

Keyword(s):

Image Quality ◽

Contrast Enhancement ◽

Mean Squared Error ◽

Signal To Noise Ratio ◽

Histogram Equalization ◽

Gray Level ◽

Signal To Noise ◽

Squared Error ◽

Noise Ratio

A two Level Image Quality enhancement is proposed in this paper. In the first level, Dualistic Sub-Image Histogram Equalization DSIHE method decomposes the original image into two sub-images based on median of original images. The second level deals with spikes shaped noise that may appear in the image after processing. We presents three methods of image enhancement GHE, LHE and proposed DSIHE that improve the visual quality of images. A comparative calculations is being carried out on above mentioned techniques to examine objective and subjective image quality parameters e.g. Peak Signal-to-Noise Ratio PSNR values, entropy H and mean squared error MSE to measure the quality of gray scale enhanced images. For handling gray-level images, convenient Histogram Equalization methods e.g. GHE and LHE tend to change the mean brightness of an image to middle level of the gray-level range limiting their appropriateness for contrast enhancement in consumer electronics such as TV monitors. The DSIHE methods seem to overcome this disadvantage as they tend to preserve both, the brightness and contrast enhancement. Experimental results show that the proposed technique gives better results in terms of Discrete Entropy, Signal to Noise ratio and Mean Squared Error values than the Global and Local histogram-based equalization methods

Download Full-text

Singular Values Decomposition and Lifting Wavelet Transform for Speech Signal Embedding into Digital Image

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096511666180511151646 ◽

2019 ◽

Vol 12 (2) ◽

pp. 138-151

Author(s):

Mourad Talbi ◽

Med Salim Bouhlel

Keyword(s):

Wavelet Transform ◽

Speech Signal ◽

Signal To Noise Ratio ◽

Perceptual Quality ◽

Lifting Wavelet Transform ◽

Signal To Noise ◽

Perceptual Evaluation ◽

Lifting Wavelet ◽

Noise Ratio

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.

Download Full-text

A New Weighted Loss for Single Channel Speech Enhancement under Low Signal-to-Noise Ratio Environment

2020 15th IEEE International Conference on Signal Processing (ICSP) ◽

10.1109/icsp48669.2020.9320989 ◽

2020 ◽

Author(s):

Jian Xiao ◽

Hongqing Liu ◽

Yi Zhou ◽

Zhen Luo

Keyword(s):

Speech Enhancement ◽

Single Channel ◽

Signal To Noise Ratio ◽

Signal To Noise ◽

Noise Ratio

Download Full-text

Multilag Correlation Estimators for Polarimetric Radar Measurements in the Presence of Noise

Journal of Atmospheric and Oceanic Technology ◽

10.1175/jtech-d-11-00010.1 ◽

2012 ◽

Vol 29 (6) ◽

pp. 772-795 ◽

Cited By ~ 24

Author(s):

Lei Lei ◽

Guifu Zhang ◽

Richard J. Doviak ◽

Robert Palmer ◽

Boon Leng Cheong ◽

...

Keyword(s):

Theoretical Analysis ◽

Data Quality ◽

Signal To Noise Ratio ◽

Simulated Data ◽

Radar Data ◽

Polarimetric Radar ◽

Signal To Noise ◽

Spectral Moments ◽

Radar Measurements

Abstract The quality of polarimetric radar data degrades as the signal-to-noise ratio (SNR) decreases. This substantially limits the usage of collected polarimetric radar data to high SNR regions. To improve data quality at low SNRs, multilag correlation estimators are introduced. The performance of the multilag estimators for spectral moments and polarimetric parameters is examined through a theoretical analysis and by the use of simulated data. The biases and standard deviations of the estimates are calculated and compared with those estimates obtained using the conventional method.

Download Full-text

Encryption Based Image Watermarking Algorithm in 2DWT-DCT Domains

Sensors ◽

10.3390/s21165540 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5540

Author(s):

Nayeem Hasan ◽

Md Saiful Islam ◽

Wenyu Chen ◽

Muhammad Ashad Kabir ◽

Saad Al-Ahmadi

Keyword(s):

Signal To Noise Ratio ◽

Image Watermarking ◽

Similarity Index ◽

Structural Similarity ◽

Reconstructed Image ◽

Discrete Wavelet ◽

Signal To Noise ◽

Noise Ratio ◽

Watermarking Scheme

This paper proposes an encryption-based image watermarking scheme using a combination of second-level discrete wavelet transform (2DWT) and discrete cosine transform (DCT) with an auto extraction feature. The 2DWT has been selected based on the analysis of the trade-off between imperceptibility of the watermark and embedding capacity at various levels of decomposition. DCT operation is applied to the selected area to gather the image coefficients into a single vector using a zig-zig operation. We have utilized the same random bit sequence as the watermark and seed for the embedding zone coefficient. The quality of the reconstructed image was measured according to bit correction rate, peak signal-to-noise ratio (PSNR), and similarity index. Experimental results demonstrated that the proposed scheme is highly robust under different types of image-processing attacks. Several image attacks, e.g., JPEG compression, filtering, noise addition, cropping, sharpening, and bit-plane removal, were examined on watermarked images, and the results of our proposed method outstripped existing methods, especially in terms of the bit correction ratio (100%), which is a measure of bit restoration. The results were also highly satisfactory in terms of the quality of the reconstructed image, which demonstrated high imperceptibility in terms of peak signal-to-noise ratio (PSNR ≥ 40 dB) and structural similarity (SSIM ≥ 0.9) under different image attacks.

Download Full-text

Joint estimation of multi‐target signal‐to‐noise ratio and dynamic states in cluttered environment

IET Radar Sonar & Navigation ◽

10.1049/iet-rsn.2016.0416 ◽

2017 ◽

Vol 11 (3) ◽

pp. 539-549 ◽

Cited By ~ 3

Author(s):

Seung‐Hwan Bae ◽

Jongyoul Park ◽

Kuk‐Jin Yoon

Keyword(s):

Signal To Noise Ratio ◽

Joint Estimation ◽

Signal To Noise ◽

Target Signal ◽

Cluttered Environment ◽

Noise Ratio ◽

Dynamic States

Download Full-text

Improving the tracking quality of the weld seam butt with V-form grooving by using Kalman filter and neural network

Automation. Modern Techologies ◽

10.36652/0869-4931-2021-75-11-500-509 ◽

2021 ◽

Author(s):

Keyword(s):

Neural Network ◽

Kalman Filter ◽

Colored Noise ◽

Weld Seam ◽

Vision Sensor ◽

Laser Vision Sensor ◽

Characteristic Points ◽

Laser Vision ◽

Weld Seams

An algorithm for tracking of the welded seams grooving by using a Kalman filter based on six characteristic points of the profile obtained using the RF627 laser vision sensor is proposed. In order to reduce the error in weld seams control, a multilayer neural network with a backpropagation algorithm is created to compensate for errors caused by colored noise when using the Kalman filter. Experimental results show that when the algorithm is applied, the error in tracking the trajectory of weld seams is reduced. Keywords tracking of weld seams; multilayer/multi-pass welding; Kalman filter; multilayer perceptron

Download Full-text

The Influence of Alteration of kVp and mAs towards the Image Quality of Acrylic Using CBCT

Key Engineering Materials ◽

10.4028/www.scientific.net/kem.829.252 ◽

2019 ◽

Vol 829 ◽

pp. 252-257

Author(s):

Azhari ◽

Yohanes Hutasoit ◽

Freddy Haryanto

Keyword(s):

Image Quality ◽

Signal To Noise Ratio ◽

Region Of Interest ◽

Signal To Noise ◽

Bite Block ◽

Significant Difference ◽

Noise Ratio ◽

Post Hoc ◽

Radiograph Image

CBCT is a modernized technology in producing radiograph image on dentistry. The image quality excellence is very important for clinicians to interpret the image, so the result of diagnosis produced becoming more accurate, appropriate, thus minimizing the working time. This research was aimed to assess the image quality using the blank acrylic phantom polymethylmethacrylate (PMMA) (C5H8O2)n in the density of 1.185 g/cm3 for evaluating the homogeneity and uniformity of the image produced. Acrylic phantom was supported with a tripod and laid down on the chin rest of the CBCT device, then the phantom was fixed, and the edge of the phantom was touched by the bite block. Furthermore, the exposure of the X-ray was executed toward the acrylic phantom with various kVp and mAs, from 80 until 90, with the range of 5 kV and the variation of mA was 3, 5, and 7 mA respectively. The time exposure was kept constant for 25 seconds. The samples were taken from CBCT acrylic images, then as much as 5 ROIs (Region of Interest) was chosen to be analyzed. The ROIs determination was analyzed by using the ImageJ® software for recognizing the influence of kVp and mAs towards the image uniformity, noise and SNR. The lowest kVp and mAs had the result of uniformity value, homogeneity and signal to noise ratio of 11.22; 40.35; and 5.96 respectively. Meanwhile, the highest kVp and mAs had uniformity value, homogeneity and signal to noise ratio of 16.96; 26.20; and 5.95 respectively. There were significant differences between the image uniformity and homogeneity on the lowest kVp and mAs compared to the highest kVp and mAs, as analyzed with the ANOVA statistics analysis continued with the t-student post-hoc test with α = 0.05. However, there was no significant difference in SNR as analyzed with the ANOVA statistic analysis. The usage of the higher kVp and mAs caused the improvement of the image homogeneity and uniformity compared to the lower kVp and mAs.

Download Full-text