scholarly journals Dual and joint estimation for speech enhancement

2018 ◽  
Vol 7 (2.7) ◽  
pp. 5
Author(s):  
V Gopi Tilak ◽  
S Koteswara Rao

Maintaining good quality and intelligibility of speech is the primary constraint in mobile communications. The present work is on the enhancement of speech under the consideration of additive white and colored noise environments using Kalman filter. Dual and Joint estimation techniques were applied and the quality of speech is analyzed through the signal to noise ratio. The techniques were applied in both ideal and practical cases for two different speech samples.

2012 ◽  
Vol 239-240 ◽  
pp. 1274-1278
Author(s):  
Guang Yan Wang ◽  
Yan Xiang Geng ◽  
Xiao Qun Zhao

In this paper, we propose a speech enhancement technique in terms of subspace methods to reduce the white or colored noise in strong background noise environment. This subspace approach based on Karhunen-Loève transform (KLT) and implemented via Principal Component Analysis (PCA). The subspace selection provided by the minimum description length (MDL) criterion. An offset factor generated from the white noise was used to modify the variance to adapt to the specified colored noise. The objective speech quality measures SegSNR have been introduced to evaluate the performance of the proposed method in time domain. A large amount of data and figures testify that our algorithm provides high performance for a large scale of input signal-to-noise ratio (-5~10dB). The performance of our algorithm is assessed in white and colored noise.


2014 ◽  
Vol 2 (2) ◽  
pp. 47-58
Author(s):  
Ismail Sh. Baqer

A two Level Image Quality enhancement is proposed in this paper. In the first level, Dualistic Sub-Image Histogram Equalization DSIHE method decomposes the original image into two sub-images based on median of original images. The second level deals with spikes shaped noise that may appear in the image after processing. We presents three methods of image enhancement GHE, LHE and proposed DSIHE that improve the visual quality of images. A comparative calculations is being carried out on above mentioned techniques to examine objective and subjective image quality parameters e.g. Peak Signal-to-Noise Ratio PSNR values, entropy H and mean squared error MSE to measure the quality of gray scale enhanced images. For handling gray-level images, convenient Histogram Equalization methods e.g. GHE and LHE tend to change the mean brightness of an image to middle level of the gray-level range limiting their appropriateness for contrast enhancement in consumer electronics such as TV monitors. The DSIHE methods seem to overcome this disadvantage as they tend to preserve both, the brightness and contrast enhancement. Experimental results show that the proposed technique gives better results in terms of Discrete Entropy, Signal to Noise ratio and Mean Squared Error values than the Global and Local histogram-based equalization methods


Author(s):  
Mourad Talbi ◽  
Med Salim Bouhlel

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.


2012 ◽  
Vol 29 (6) ◽  
pp. 772-795 ◽  
Author(s):  
Lei Lei ◽  
Guifu Zhang ◽  
Richard J. Doviak ◽  
Robert Palmer ◽  
Boon Leng Cheong ◽  
...  

Abstract The quality of polarimetric radar data degrades as the signal-to-noise ratio (SNR) decreases. This substantially limits the usage of collected polarimetric radar data to high SNR regions. To improve data quality at low SNRs, multilag correlation estimators are introduced. The performance of the multilag estimators for spectral moments and polarimetric parameters is examined through a theoretical analysis and by the use of simulated data. The biases and standard deviations of the estimates are calculated and compared with those estimates obtained using the conventional method.


Sensors ◽  
2021 ◽  
Vol 21 (16) ◽  
pp. 5540
Author(s):  
Nayeem Hasan ◽  
Md Saiful Islam ◽  
Wenyu Chen ◽  
Muhammad Ashad Kabir ◽  
Saad Al-Ahmadi

This paper proposes an encryption-based image watermarking scheme using a combination of second-level discrete wavelet transform (2DWT) and discrete cosine transform (DCT) with an auto extraction feature. The 2DWT has been selected based on the analysis of the trade-off between imperceptibility of the watermark and embedding capacity at various levels of decomposition. DCT operation is applied to the selected area to gather the image coefficients into a single vector using a zig-zig operation. We have utilized the same random bit sequence as the watermark and seed for the embedding zone coefficient. The quality of the reconstructed image was measured according to bit correction rate, peak signal-to-noise ratio (PSNR), and similarity index. Experimental results demonstrated that the proposed scheme is highly robust under different types of image-processing attacks. Several image attacks, e.g., JPEG compression, filtering, noise addition, cropping, sharpening, and bit-plane removal, were examined on watermarked images, and the results of our proposed method outstripped existing methods, especially in terms of the bit correction ratio (100%), which is a measure of bit restoration. The results were also highly satisfactory in terms of the quality of the reconstructed image, which demonstrated high imperceptibility in terms of peak signal-to-noise ratio (PSNR ≥ 40 dB) and structural similarity (SSIM ≥ 0.9) under different image attacks.


Author(s):  

An algorithm for tracking of the welded seams grooving by using a Kalman filter based on six characteristic points of the profile obtained using the RF627 laser vision sensor is proposed. In order to reduce the error in weld seams control, a multilayer neural network with a backpropagation algorithm is created to compensate for errors caused by colored noise when using the Kalman filter. Experimental results show that when the algorithm is applied, the error in tracking the trajectory of weld seams is reduced. Keywords tracking of weld seams; multilayer/multi-pass welding; Kalman filter; multilayer perceptron


2019 ◽  
Vol 829 ◽  
pp. 252-257
Author(s):  
Azhari ◽  
Yohanes Hutasoit ◽  
Freddy Haryanto

CBCT is a modernized technology in producing radiograph image on dentistry. The image quality excellence is very important for clinicians to interpret the image, so the result of diagnosis produced becoming more accurate, appropriate, thus minimizing the working time. This research was aimed to assess the image quality using the blank acrylic phantom polymethylmethacrylate (PMMA) (C­5H8O2)n in the density of 1.185 g/cm3 for evaluating the homogeneity and uniformity of the image produced. Acrylic phantom was supported with a tripod and laid down on the chin rest of the CBCT device, then the phantom was fixed, and the edge of the phantom was touched by the bite block. Furthermore, the exposure of the X-ray was executed toward the acrylic phantom with various kVp and mAs, from 80 until 90, with the range of 5 kV and the variation of mA was 3, 5, and 7 mA respectively. The time exposure was kept constant for 25 seconds. The samples were taken from CBCT acrylic images, then as much as 5 ROIs (Region of Interest) was chosen to be analyzed. The ROIs determination was analyzed by using the ImageJ® software for recognizing the influence of kVp and mAs towards the image uniformity, noise and SNR. The lowest kVp and mAs had the result of uniformity value, homogeneity and signal to noise ratio of 11.22; 40.35; and 5.96 respectively. Meanwhile, the highest kVp and mAs had uniformity value, homogeneity and signal to noise ratio of 16.96; 26.20; and 5.95 respectively. There were significant differences between the image uniformity and homogeneity on the lowest kVp and mAs compared to the highest kVp and mAs, as analyzed with the ANOVA statistics analysis continued with the t-student post-hoc test with α = 0.05. However, there was no significant difference in SNR as analyzed with the ANOVA statistic analysis. The usage of the higher kVp and mAs caused the improvement of the image homogeneity and uniformity compared to the lower kVp and mAs.


Sign in / Sign up

Export Citation Format

Share Document