Output-based method of applying PESQ to measure the perceptual quality of framed speech signals

Background: In this paper, we propose a secure image watermarking technique which is applied to grayscale and color images. It consists in applying the SVD (Singular Value Decomposition) in the Lifting Wavelet Transform domain for embedding a speech image (the watermark) into the host image. Methods: It also uses signature in the embedding and extraction steps. Its performance is justified by the computation of PSNR (Pick Signal to Noise Ratio), SSIM (Structural Similarity), SNR (Signal to Noise Ratio), SegSNR (Segmental SNR) and PESQ (Perceptual Evaluation Speech Quality). Results: The PSNR and SSIM are used for evaluating the perceptual quality of the watermarked image compared to the original image. The SNR, SegSNR and PESQ are used for evaluating the perceptual quality of the reconstructed or extracted speech signal compared to the original speech signal. Conclusion: The Results obtained from computation of PSNR, SSIM, SNR, SegSNR and PESQ show the performance of the proposed technique.

Download Full-text

Cents versus scale steps: Can we tell the difference between major and minor thirds?

Psychology of Music ◽

10.1177/0305735620987272 ◽

2021 ◽

pp. 030573562098727

Author(s):

Pedro Neto ◽

Patricia M Vanzella

Keyword(s):

Auditory Processing ◽

Perceptual Quality ◽

Chromatic Scale ◽

Major Scale ◽

Accurate Performance ◽

The Difference ◽

Scale Step ◽

Frequency Ratios ◽

Tonal Context

We report an experiment in which participants ( N = 368) were asked to differentiate between major and minor thirds. These intervals could either be formed by diatonic tones from the C major scale (tonal condition) or by a subset of tones from the chromatic scale (atonal condition). We hypothesized that in the tonal condition intervals would be perceived as a function of scale step distances, which we defined as the number of diatonic leaps between two notes of a given music scale. In the atonal condition, we hypothesized that intervals would be perceived as a function of cents. If our hypotheses were supported, we should verify a less accurate performance in the tonal condition, where scale step distances are the same between major and minor thirds. The data corroborated our hypotheses, and we suggest that acoustic measurements of intervallic distances (i.e., frequency ratios and cents) are not optimal when it comes to describing the perceptual quality of intervals in a tonal context. Finally, our research points to the possibility that, in comparison with previous models, scale steps and cents might better capture the notion of global versus local instances of auditory processing.

Download Full-text