Block Based Enhancement using Deep Learning for Conversion of Low Resolution AVS Video to High Resolution HEVC Video

10.36227/techrxiv.14887917 ◽

2021 ◽

Author(s):

Rizwan Qureshi ◽

Mehmood Nawaz

Keyword(s):

Deep Learning ◽

High Efficiency ◽

Motion Vector ◽

Super Resolution ◽

Computational Time ◽

Video Frame ◽

Low Resolution ◽

High Efficiency Video Coding ◽

High Definition ◽

Resolution Method

Conversion of one video bitstream to another video bitstream is a challenging task in the heterogeneous transcoder due to different video formats. In this paper, a region of interest (ROI) based super resolution technique is used to convert the lowresolution AVS (audio video standard) video to high definition HEVC (high efficiency video coding) video. Firstly, we classify a low-resolution video frame into small blocks by using visual characteristics, transform coefficients, and motion vector (MV) of a video. These blocks are further classified as blocks of most interest (BOMI), blocks of less interest (BOLI) and blocks of noninterest (BONI). The BONI blocks are considered as background blocks due to less interest in video and remains unchanged during SR process. Secondly, we apply deep learning based super resolution method on low resolution BOMI, and BOLI blocks to enhance the visual quality. The BOMI and BOLI blocks have high attention due to ROI that include some motion and contrast of the objects. The proposed method saves 20% to 30% computational time and obtained appreciable results as compared with full frame based super resolution method. We have tested our method on different official video sequences with resolution of 1K, 2K, and 4K. Our proposed method has an efficient visual performance in contrast to the full frame-based super resolution method.

Download Full-text

FPGA-based implementation of intra prediction module for HEVC decoder

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f2440.078219 ◽

2019 ◽

Vol 8 (2) ◽

pp. 6130-6137

Keyword(s):

Power Consumption ◽

High Efficiency ◽

Cmos Technology ◽

Intra Prediction ◽

Hardware Architecture ◽

Video Frame ◽

High Efficiency Video Coding ◽

High Definition ◽

Prediction Module ◽

Block Sizes

The High Efficiency Video Coding (HEVC) is the new standard which is designed to support High Definition (HD) and Ultra-HD video cotenants. In HEVC, several new coding tools are adopted in order to improve the coding efficiency and compression ratio but with a significant increase in the computational complexity comparing to the previous standard H.264/AVC. In this paper, we focus on reducing the complexity of the most consuming block in the HEVC decoder standard which is the intra prediction module. In this context, we propose an optimized hardware architecture dedicated to support the 34 modes of intra prediction module considering 4×4, 8×8 and 16×16 block sizes. The proposed design exploits the symmetric property between horizontal and vertical modes. Hence, we implement a new hardware architecture that factorizes the same hardware resources for both directions which leads to save the hardware cost, the power consumption and the processing time. Furthermore, the different block sizes are implemented independently in order to avoid memory overhead while accessing to the shared memory. The implemented design using Xilinx Zynq-based FPGA platform can process in real time the Ultra-HD video frame of resolution (4096×2048) at 232 MHz. As well, the synthesis results using the TSMC 180 nm CMOS technology provide similar performance than our FPGA implementation. Finally, the HW/SW implementation of full HEVC decoder can process the decoding of 15 FPS in best case for 240p video resolution with a gain of 60% in power consumption.

Download Full-text

Fast CU Size Decision Method Based on Just Noticeable Distortion and Deep Learning

Scientific Programming ◽

10.1155/2021/3813116 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Jinchao Zhao ◽

Yihan Wang ◽

Qiuwen Zhang

Keyword(s):

Deep Learning ◽

Video Coding ◽

High Efficiency ◽

Threshold Model ◽

High Efficiency Video Coding ◽

High Definition ◽

Coding Efficiency ◽

Decision Method ◽

Just Noticeable Distortion ◽

Cu Size Decision

With the development of broadband networks and high-definition displays, people have higher expectations for the quality of video images, which also brings new requirements and challenges to video coding technology. Compared with H.265/High Efficiency Video Coding (HEVC), the latest video coding standard, Versatile Video Coding (VVC), can save 50%-bit rate while maintaining the same subjective quality, but it leads to extremely high encoding complexity. To decrease the complexity, a fast coding unit (CU) size decision method based on Just Noticeable Distortion (JND) and deep learning is proposed in this paper. Specifically, the hybrid JND threshold model is first designed to distinguish smooth, normal, or complex region. Then, if CU belongs to complex area, the Ultra-Spherical SVM (US-SVM) classifiers are trained for forecasting the best splitting mode. Experimental results illustrate that the proposed method can save about 52.35% coding runtime, which can realize a trade-off between the reduction of computational burden and coding efficiency compared with the latest methods.

Download Full-text

Video Compression Standards for High Definition Video: A Comparative Study of High Efficiency Video Coding and H.264/MPEG-4 AVC

i-manager s Journal on Communication Engineering and Systems ◽

10.26634/jcs.2.4.2474 ◽

2013 ◽

Vol 2 (4) ◽

pp. 14-19

Author(s):

Geethu Raj ◽

M. Kannan

Keyword(s):

Comparative Study ◽

Video Coding ◽

Video Compression ◽

High Efficiency ◽

High Efficiency Video Coding ◽

High Definition ◽

High Definition Video

Download Full-text

A super-resolution method of combined color image with depth map based on deep learning

Proceedings of the 2020 International Conference on Cyberspace Innovation of Advanced Technologies ◽

10.1145/3444370.3444539 ◽

2020 ◽

Author(s):

Wei Zhang ◽

Chi-fu Yang ◽

Feng Jiang ◽

Xian-zhong Gao ◽

Kai Yang

Keyword(s):

Deep Learning ◽

Color Image ◽

Super Resolution ◽

Depth Map ◽

Resolution Method

Download Full-text

Robust and fast selective encryption for HEVC videos

Journal of Communications Software and Systems ◽

10.24138/jcomss.v10i4.118 ◽

2014 ◽

Vol 10 (4) ◽

pp. 221 ◽

Cited By ~ 1

Author(s):

Mokhtar Ouamri ◽

Kamel M. Faraoun

Keyword(s):

High Efficiency ◽

Operating Mode ◽

Advanced Encryption Standard ◽

Selective Encryption ◽

High Efficiency Video Coding ◽

High Definition ◽

Network Applications ◽

Encryption Schemes ◽

Visual Degradation ◽

Reliability And Robustness

Emerging High efficiency video coding (HEVC) is expected to be widely adopted in network applications for high definition devices and mobile terminals. Thus, construction of HEVC's encryption schemes that maintain format compliance and bit rate of encrypted bitstream becomes an active security's researches area. This paper presents a novel selective encryption technique for HEVC videos, based on enciphering the bins of selected Golomb–Rice code’s suffixes with the Advanced Encryption Standard (AES) in a CBC operating mode. The scheme preserves format compliance and size of the encrypted HEVC bitstream, and provides high visual degradation with optimized encryption space defined by selected Golomb–Rice suffixes. Experimental results show reliability and robustness of the proposed technique.

Download Full-text

Region Filling with Super Resolution Algorithm

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35707 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 3557-3564

Author(s):

Darakhshan R. Khan

Keyword(s):

Patch Size ◽

Research Work ◽

Super Resolution ◽

Input Image ◽

Computational Time ◽

Low Resolution ◽

Resolution Image ◽

Resolution Algorithm ◽

Region Filling

Region filling which has another name inpainting, is an approach to find the values of missing pixels from data available in the remaining portion of the image. The missing information must be recalculated in a distinctly convincing manner, such that, image look seamless. This research work has built a methodology for completely automating patch priority based region filling process. To reduce the computational time, low resolution image is constructed from input image. Based on texel of an image, patch size is determined. Several low resolution image with missing region filled is generated using region filling algorithm. Pixel information from these low resolution images is consolidated to produce single low resolution region filled image. Finally, super resolution algorithm is applied to enhance the quality of image and regain all specifics of image. This methodology of identifying patch size based on input fed has an advantage over filling algorithms which in true sense automate the process of region filling, to deal with sensitivity in region filling, algorithm different parameter settings are used and functioning with coarse version of image will notably reduce the computational time.

Download Full-text

Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

Scientific Programming ◽

10.1155/2020/8883214 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jinchao Zhao ◽

Yihan Wang ◽

Qiuwen Zhang

Keyword(s):

Deep Learning ◽

Video Coding ◽

High Efficiency ◽

Texture Classification ◽

Rate Distortion ◽

Classification Model ◽

High Efficiency Video Coding ◽

Fast Encoding ◽

Training Samples ◽

Coding Unit

With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.

Download Full-text

Quality-Oriented Perceptual HEVC Based on the Spatiotemporal Saliency Detection Model

Entropy ◽

10.3390/e21020165 ◽

2019 ◽

Vol 21 (2) ◽

pp. 165 ◽

Cited By ~ 5

Author(s):

Xiantao Jiang ◽

Tian Song ◽

Daqi Zhu ◽

Takafumi Katayama ◽

Lu Wang

Keyword(s):

Video Coding ◽

High Efficiency ◽

Saliency Detection ◽

Saliency Map ◽

High Efficiency Video Coding ◽

High Definition ◽

Perceptual Coding ◽

Coding Scheme ◽

Saliency Model ◽

Video Saliency

Perceptual video coding (PVC) can provide a lower bitrate with the same visual quality compared with traditional H.265/high efficiency video coding (HEVC). In this work, a novel H.265/HEVC-compliant PVC framework is proposed based on the video saliency model. Firstly, both an effective and efficient spatiotemporal saliency model is used to generate a video saliency map. Secondly, a perceptual coding scheme is developed based on the saliency map. A saliency-based quantization control algorithm is proposed to reduce the bitrate. Finally, the simulation results demonstrate that the proposed perceptual coding scheme shows its superiority in objective and subjective tests, achieving up to a 9.46% bitrate reduction with negligible subjective and objective quality loss. The advantage of the proposed method is the high quality adapted for a high-definition video application.

Download Full-text

Transcoding method from H.264/AVC to high efficiency video coding based on similarity of intraprediction, interprediction, and motion vector

Journal of Electronic Imaging ◽

10.1117/1.jei.25.5.053028 ◽

2016 ◽

Vol 25 (5) ◽

pp. 053028

Author(s):

Mei-Feng Liu ◽

Guo-Yun Zhong ◽

Xiao-Hai He ◽

Lin-Bo Qing

Keyword(s):

Video Coding ◽

High Efficiency ◽

Motion Vector ◽

High Efficiency Video Coding

Download Full-text