Output-based method of applying PESQ to measure the perceptual quality of framed speech signals

Author(s):  
A.E. Conway
Author(s):  
Mourad Talbi ◽  
Med Salim Bouhlel

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.


2021 ◽  
pp. 030573562098727
Author(s):  
Pedro Neto ◽  
Patricia M Vanzella

We report an experiment in which participants ( N = 368) were asked to differentiate between major and minor thirds. These intervals could either be formed by diatonic tones from the C major scale (tonal condition) or by a subset of tones from the chromatic scale (atonal condition). We hypothesized that in the tonal condition intervals would be perceived as a function of scale step distances, which we defined as the number of diatonic leaps between two notes of a given music scale. In the atonal condition, we hypothesized that intervals would be perceived as a function of cents. If our hypotheses were supported, we should verify a less accurate performance in the tonal condition, where scale step distances are the same between major and minor thirds. The data corroborated our hypotheses, and we suggest that acoustic measurements of intervallic distances (i.e., frequency ratios and cents) are not optimal when it comes to describing the perceptual quality of intervals in a tonal context. Finally, our research points to the possibility that, in comparison with previous models, scale steps and cents might better capture the notion of global versus local instances of auditory processing.


2015 ◽  
Vol 22 (4) ◽  
pp. 14-28 ◽  
Author(s):  
Jingxi Xu ◽  
Benjamin W. Wah

2016 ◽  
Vol 34 (1) ◽  
pp. 51-82 ◽  
Author(s):  
Manuela Chessa ◽  
Guido Maiello ◽  
Alessia Borsari ◽  
Peter J. Bex

2016 ◽  
Vol 18 (9) ◽  
pp. 1808-1818 ◽  
Author(s):  
Soobeom Jang ◽  
Jong-Seok Lee
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document