Evaluation of objective video quality assessment methods on video sequences with different spatial and temporal activity encoded at different spatial resolutions

Jelena Vlaović; Drago Žagar; Snježana Rimac-Drlje; Mario Vranješ

doi:10.32985/ijeces.12.1.1

Evaluation of objective video quality assessment methods on video sequences with different spatial and temporal activity encoded at different spatial resolutions

International journal of electrical and computer engineering systems ◽

10.32985/ijeces.12.1.1 ◽

2021 ◽

Vol 12 (1) ◽

pp. 1-9

Author(s):

Jelena Vlaović ◽

Drago Žagar ◽

Snježana Rimac-Drlje ◽

Mario Vranješ

Keyword(s):

Quality Assessment ◽

Signal To Noise Ratio ◽

Video Quality ◽

Similarity Index ◽

Structural Similarity ◽

Video Sequences ◽

Large Set ◽

Signal To Noise ◽

Structural Similarity Index

With the development of Video on Demand applications due to the availability of high-speed internet access, adaptive streaming algorithms have been developing and improving. The focus is on improving user’s Quality of Experience (QoE) and taking it into account as one of the parameters for the adaptation algorithm. Users often experience changing network conditions, so the goal is to ensure stable video playback with satisfying QoE level. Although subjective Video Quality Assessment (VQA) methods provide more accurate results regarding user’s QoE, objective VQA methods cost less and are less time-consuming. In this article, nine different objective VQA methods are compared on a large set of video sequences with various spatial and temporal activities. VQA methods used in this analysis are: Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index (SSIM), MultiScale Structural Similarity Index (MS-SSIM), Video Quality Metric (VQM), Mean Sum of Differences (DELTA), Mean Sum of Absolute Differences (MSAD), Mean Squared Error (MSE), Netflix Video Multimethod Assessment Fusion (Netflix VMAF) and Visual Signal-to-Noise Ratio (VSNR). The video sequences used for testing purposes were encoded according to H.264/AVC with twelve different target coding bitrates, at three different spatial resolutions (resulting in a total of 190 sequences). In addition to objective quality assessment, subjective quality assessment was performed for these sequences. All results acquired by objective VQA methods have been compared with subjective Mean Opinion Score (MOS) results using Pearson Linear Correlation Coefficient (PLCC). Measurement results obtained on a large set of video sequences with different spatial resolutions show that VQA methods like SSIM and VQM correlate better with MOS results compared to PSNR, SSIM, VSNR, DELTA, MSE, VMAF and MSAD. However, the PLCC results for SSIM and VQM are too low (0.7799 and 0.7734, respectively), for the usage of these methods in streaming services instead of subjective testing. These results suggest that more efficient VQA methods should be developed to be used in streaming testing procedures as well as to support the video segmentation process. Furthermore, when comparing results obtained for different spatial resolutions, it can be concluded that the quality of video sequences encoded at lower spatial resolutions in cases of lower target coding bitrate is higher compared to the quality of video sequences encoded at higher spatial resolutions at the same target coding bitrate, particularly when video sequences with higher spatial and temporal information are used.

Download Full-text

ANALISIS SENSITIVITAS VIDEO MPEG-4 BERDASARKAN STRUKTUR FRAME PADA TRANSMISI DVB-T

Jurnal Ilmiah Informatika Komputer ◽

10.35760/ik.2020.v25i2.2691 ◽

2020 ◽

Vol 25 (2) ◽

pp. 86-97

Author(s):

Sandy Suryo Prayogo ◽

Tubagus Maulana Kusuma

Keyword(s):

Deep Learning ◽

Bit Error Rate ◽

Error Rate ◽

Signal To Noise Ratio ◽

Similarity Index ◽

Structural Similarity ◽

Signal To Noise ◽

Structural Similarity Index ◽

Noise Ratio

DVB merupakan standar transmisi televisi digital yang paling banyak digunakan saat ini. Unsur terpenting dari suatu proses transmisi adalah kualitas gambar dari video yang diterima setelah melalui proses transimisi tersebut. Banyak faktor yang dapat mempengaruhi kualitas dari suatu gambar, salah satunya adalah struktur frame dari video. Pada tulisan ini dilakukan pengujian sensitifitas video MPEG-4 berdasarkan struktur frame pada transmisi DVB-T. Pengujian dilakukan menggunakan simulasi matlab dan simulink. Digunakan juga ffmpeg untuk menyediakan format dan pengaturan video akan disimulasikan. Variabel yang diubah dari video adalah bitrate dan juga group-of-pictures (GOP), sedangkan variabel yang diubah dari transmisi DVB-T adalah signal-to-noise-ratio (SNR) pada kanal AWGN di antara pengirim (Tx) dan penerima (Rx). Hasil yang diperoleh dari percobaan berupa kualitas rata-rata gambar pada video yang diukur menggunakan metode pengukuran structural-similarity-index (SSIM). Dilakukan juga pengukuran terhadap jumlah bit-error-rate BER pada bitstream DVB-T. Percobaan yang dilakukan dapat menunjukkan seberapa besar sensitifitas bitrate dan GOP dari video pada transmisi DVB-T dengan kesimpulan semakin besar bitrate maka akan semakin buruk nilai kualitas gambarnya, dan semakin kecil nilai GOP maka akan semakin baik nilai kualitasnya. Penilitian diharapkan dapat dikembangkan menggunakan deep learning untuk memperoleh frame struktur yang tepat di kondisi-kondisi tertentu dalam proses transmisi televisi digital.

Download Full-text

Encryption Based Image Watermarking Algorithm in 2DWT-DCT Domains

Sensors ◽

10.3390/s21165540 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5540

Author(s):

Nayeem Hasan ◽

Md Saiful Islam ◽

Wenyu Chen ◽

Muhammad Ashad Kabir ◽

Saad Al-Ahmadi

Keyword(s):

Signal To Noise Ratio ◽

Image Watermarking ◽

Similarity Index ◽

Structural Similarity ◽

Reconstructed Image ◽

Discrete Wavelet ◽

Signal To Noise ◽

Noise Ratio ◽

Watermarking Scheme

This paper proposes an encryption-based image watermarking scheme using a combination of second-level discrete wavelet transform (2DWT) and discrete cosine transform (DCT) with an auto extraction feature. The 2DWT has been selected based on the analysis of the trade-off between imperceptibility of the watermark and embedding capacity at various levels of decomposition. DCT operation is applied to the selected area to gather the image coefficients into a single vector using a zig-zig operation. We have utilized the same random bit sequence as the watermark and seed for the embedding zone coefficient. The quality of the reconstructed image was measured according to bit correction rate, peak signal-to-noise ratio (PSNR), and similarity index. Experimental results demonstrated that the proposed scheme is highly robust under different types of image-processing attacks. Several image attacks, e.g., JPEG compression, filtering, noise addition, cropping, sharpening, and bit-plane removal, were examined on watermarked images, and the results of our proposed method outstripped existing methods, especially in terms of the bit correction ratio (100%), which is a measure of bit restoration. The results were also highly satisfactory in terms of the quality of the reconstructed image, which demonstrated high imperceptibility in terms of peak signal-to-noise ratio (PSNR ≥ 40 dB) and structural similarity (SSIM ≥ 0.9) under different image attacks.

Download Full-text

Is there a relationship between peak-signal-to-noise ratio and structural similarity index measure?

IET Image Processing ◽

10.1049/iet-ipr.2012.0489 ◽

2013 ◽

Vol 7 (1) ◽

pp. 12-24 ◽

Cited By ~ 21

Author(s):

Alain Horé ◽

Djemel Ziou

Keyword(s):

Signal To Noise Ratio ◽

Similarity Index ◽

Structural Similarity ◽

Signal To Noise ◽

Structural Similarity Index ◽

Noise Ratio ◽

Index Measure

Download Full-text

A Cryptographic based Approach for Data Hiding in Advanced Video Sequences

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i6.4808 ◽

2021 ◽

Vol 12 (6) ◽

pp. 2031-2038

Author(s):

Vinay D R, Dr. Anand Babu J

Keyword(s):

Data Hiding ◽

Signal To Noise Ratio ◽

Video Quality ◽

Similarity Index ◽

Data Communication ◽

Structural Similarity ◽

Video Stream ◽

Video Sequences ◽

Video Streams ◽

Hiding Capacity

Data hiding in video streams became more popular in the present world, since there is a high frequency of data communication over the internet. Hiding the data in video streams provides more security as well as increases embedding capacity than hiding inside the images. The quantity of information to be embedded into the video increases, it can badly influence the video excellence make it inappropriate for certain appliances. The main concerns in data hiding in videos are its high visual excellence, increased hiding capacity, video stream size etc. In this paper, a new data hiding technique is proposed in compressed H.264 Video Streams. At first, the information to be embedded is encrypted using Cryptography approach. The Cryptographic approach helps to encrypt the plain information based on the elliptic points produced by choosing the large prime number. The encrypted data is embedded into the transformed DCT coefficients of I, B and P video frames. The experiment is conducted for different set of video sequences. The results shows that the proposed method yields better performance in terms of Peak signal to noise ratio (PSNR), Structural similarity index (SSIM) and Video quality measure (VQM) when compare to existing methods.

Download Full-text

A Study of 3G Network Performance Using Video Quality Assessment: Bangkok Case Study

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.781.93 ◽

2015 ◽

Vol 781 ◽

pp. 93-97

Author(s):

Natradee Anupong ◽

Patsita Sirawongphatsara ◽

Pongpisit Wuttidittachotti ◽

Therdpong Daengsi

Keyword(s):

Quality Assessment ◽

Network Performance ◽

Service Providers ◽

Signal To Noise Ratio ◽

Video Quality ◽

Similarity Index ◽

Structural Similarity ◽

Assessment Method ◽

Video Quality Assessment ◽

3G Network

This paper presents a study of 3G network performance in the inner city of Bangkok using a video quality assessment method, called Full-Reference Assessment. After the study, Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) have been obtained. The results show that 3G networks from two 3G service providers tends to provide better video quality than the rest of 3G service providers, which are consistent with the speed test results.

Download Full-text

Comparative analysis between non-linear wavelet based image denoising techniques

Journal of Science and Engineering ◽

10.3126/jsce.v5i0.22373 ◽

2018 ◽

Vol 5 ◽

pp. 58-67

Author(s):

Milan Chikanbanjar

Keyword(s):

Visual Information ◽

Signal To Noise Ratio ◽

Similarity Index ◽

Structural Similarity ◽

Hard Thresholding ◽

Structural Similarity Index ◽

Thresholding Method ◽

Non Linear ◽

Good Quality Image

Digital images have been a major form of transmission of visual information, but due to the presence of noise, the image gets corrupted. Thus, processing of the received image needs to be done before being used in an application. Denoising of image involves data manipulation to remove noise in order to produce a good quality image retaining different details. Quantitative measures have been used to show the improvement in the quality of the restored image by the use of various thresholding techniques by the use of parameters mainly, MSE (Mean Square Error), PSNR (Peak-Signal-to-Noise-Ratio) and SSIM (Structural Similarity index). Here, non-linear wavelet transform denoising techniques of natural images are studied, analyzed and compared using thresholding techniques such as soft, hard, semi-soft, LevelShrink, SUREShrink, VisuShrink and BayesShrink. On most of the tests, PSNR and SSIM values for LevelShrink Hard thresholding method is higher as compared to other thresholding methods. For instance, from tests PSNR and SSIM values of lena image for VISUShrink Hard, VISUShrink Soft, VISUShrink Semi Soft, LevelShrink Hard, LevelShrink Soft, LevelShrink Semi Soft, SUREShrink, BayesShrink thresholding methods at the variance of 10 are 23.82, 16.51, 23.25, 24.48, 23.25, 20.67, 23.42, 23.14 and 0.28, 0.28, 0.28, 0.29, 0.22, 0.25, 0.16 respectively which shows that the PSNR and SSIM values for LevelShrink Hard thresholding method is higher as compared to other thresholding methods, and so on. Thus, it can be stated that the performance of LevelShrink Hard thresholding method is better on most of tests.

Download Full-text

Temporal video quality assessment method involving structural similarity index

2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW) ◽

10.1109/icce-tw.2016.7520921 ◽

2016 ◽

Cited By ~ 1

Author(s):

Woei-Tan Loh ◽

David B. L. Bong

Keyword(s):

Quality Assessment ◽

Video Quality ◽

Similarity Index ◽

Structural Similarity ◽

Assessment Method ◽

Video Quality Assessment ◽

Structural Similarity Index ◽

Quality Assessment Method

Download Full-text

Image Restoration Using Lucy Richardson Algorithm for Deblurring Images with Improved PSNR, SSIM, NC in Comparison with Wiener Filter

Alinteri Journal of Agricultural Sciences ◽

10.47059/alinteri/v36i1/ajas21091 ◽

2021 ◽

Vol 36 (1) ◽

pp. 642-649

Author(s):

G. Sharvani Reddy ◽

R. Nanmaran ◽

Gokul Paramasivam

Keyword(s):

Image Restoration ◽

Signal To Noise Ratio ◽

Wiener Filter ◽

Similarity Index ◽

Structural Similarity ◽

Structural Similarity Index ◽

Degraded Image ◽

Restoration Algorithms ◽

Index Measure

Aim: Image is the most powerful tool to analyze the information. Sometimes the captured image gets affected with blur and noise in the environment, which degrades the quality of the image. Image restoration is a technique in image processing where the degraded image can be restored or recovered to its nearest original image. Materials and Methods: In this research Lucy-Richardson algorithm is used for restoring blurred and noisy images using MATLAB software. And the proposed work is compared with Wiener filter, and the sample size for each group is 30. Results: The performance was compared based on three parameters, Power Signal to Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM), Normalized Correlation (NC). High values of PSNR, SSIM and NC indicate the better performance of restoration algorithms. Lucy-Richardson provides a mean PSNR of 10.4086db, mean SSIM of 0.4173%, and NC of 0.7433% and Wiener filter provides a mean PSNR of 6.3979db, SSIM of 0.3016%, NC of 0.3276%. Conclusion: Based on the experimental results and statistical analysis using independent sample T test, image restoration using Lucy-Richardson algorithm significantly performs better than Wiener filter on restoring the degraded image with PSNR (P<0.001) and SSIM (P<0.001).

Download Full-text

Video quality assessment based on Fast Structural Similarity index algorithm

2012 Fourth International Conference on Ubiquitous and Future Networks (ICUFN) ◽

10.1109/icufn.2012.6261724 ◽

2012 ◽

Cited By ~ 3

Author(s):

Liu Zhen ◽

Sun Yong ◽

Zhang Jingjing ◽

Wen Xiangming ◽

Su Tao

Keyword(s):

Quality Assessment ◽

Video Quality ◽

Similarity Index ◽

Structural Similarity ◽

Video Quality Assessment ◽

Structural Similarity Index

Download Full-text

Spherical Structural Similarity Index for Objective Omnidirectional Video Quality Assessment

2018 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme.2018.8486584 ◽

2018 ◽

Cited By ~ 12

Author(s):

Sijia Chen ◽

Yingxue Zhang ◽

Yiming Li ◽

Zhenzhong Chen ◽

Zhou Wang

Keyword(s):

Quality Assessment ◽

Video Quality ◽

Similarity Index ◽

Structural Similarity ◽

Video Quality Assessment ◽

Structural Similarity Index

Download Full-text