On SPARC LEON-2 ISA Extensions Experiments for MPEG Encoding Acceleration

This paper presents the necessary steps to modify the implementation of the SPARCV8 architecture to enhance it with multimedia-oriented instructions. The purpose is improving video compression performance without designing dedicated coprocessors. We investigate the complexity of modifying a standard processor instruction set and show that, although not trivial, this is feasible in a few weeks. We implemented 12 new instructions and use some of them to optimize the computation of a demanding step of the MPEG encoding. The result is a performance increase of 67% in the execution of a part of this algorithm, allowing us to expect a 30% speedup in the execution of an MPEG video compression. The area increase of the integer unit is about 18% and the clock frequency is not significantly modified in an LEON-2 implementing 6 among 12 of the new instructions.

Download Full-text

Per Clip Lagrangian Multiplier Optimisation for HEVC

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.10.ipas-136 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 136-1-136-7

Author(s):

Daniel J Ringis ◽

François Pitié ◽

Anil Kokaram

Keyword(s):

Video Compression ◽

Rate Distortion ◽

Lagrangian Multiplier ◽

Video Content ◽

Compression Performance ◽

Quality Video ◽

Rate Improvement ◽

Optimisation Methods ◽

High Quality Video ◽

The Impact

The majority of internet traffic is video content. This drives the demand for video compression in order to deliver high quality video at low target bitrates. This paper investigates the impact of adjusting the rate distortion equation on compression performance. An constant of proportionality, k, is used to modify the Lagrange multiplier used in H.265 (HEVC). Direct optimisation methods are deployed to maximise BD-Rate improvement for a particular clip. This leads to up to 21% BD-Rate improvement for an individual clip. Furthermore we use a more realistic corpus of material provided by YouTube. The results show that direct optimisation using BD-rate as the objective function can lead to further gains in bitrate savings that are not available with previous approaches.

Download Full-text

NISC-based MIMO MMSE Detector

Journal of Circuits System and Computers ◽

10.1142/s0218126621500699 ◽

2020 ◽

pp. 2150069

Author(s):

Mostafa Rizk ◽

Amer Baghdadi ◽

Michel Jézéquel ◽

Youssef Atat ◽

Yasser Mohanna

Keyword(s):

Dynamic Scheduling ◽

Mean Squared Error ◽

Mimo Systems ◽

Architecture Design ◽

Instruction Set ◽

Clock Frequency ◽

Maximum Throughput ◽

Performance Requirements ◽

Minimum Mean Squared Error ◽

Mmse Detector

Several application-specific processor design approaches have been proposed and investigated to cope with the emerging flexibility requirements jointly associated with the maximum performance efficiency and minimum implementation area and power consumption. Dynamic scheduling of a set of instructions generally leads to an overhead related to instruction decoding. To mitigate this overhead, other approaches have been proposed using static scheduling of datapath control signals. In this context, No-Instruction-Set-Computer (NISC) concept have been introduced considering that a dedicated processor to a specific application does not need an instruction set especially when it is programmed by its designers and not by its users. In this paper, the hardware architecture design of flexible NISC-based architecture design dedicated for minimum mean-squared error (MMSE) linear detection is presented. The devised design, which is used in iterative turbo-receiver, fulfills the performance requirements of emergent wireless communication standards with throughput reaching that of LTE-Advanced. FPGA hardware implementation of the detector architecture achieves a maximum throughput of 115.8 Mega symbols per second for [Formula: see text] and 6.4 Mega symbols per second for [Formula: see text] MIMO systems for an operating clock frequency of 202.67[Formula: see text]MHz.

Download Full-text

COMPARISON OF OPEN AND FREE VIDEO COMPRESSION SYSTEMS - A Performance Evaluation

Proceedings of the First International Conference on Computer Imaging Theory and Applications ◽

10.5220/0001809700740080 ◽

2009 ◽

Keyword(s):

Performance Evaluation ◽

Video Compression ◽

A Performance

Download Full-text

A Complexity and Quality Evaluation of Block Based Motion Estimation Algorithms

Acta Polytechnica ◽

10.14311/668 ◽

2005 ◽

Vol 45 (1) ◽

Author(s):

S. Usama ◽

M. Montaser ◽

O. Ahmed

Keyword(s):

Performance Evaluation ◽

Motion Estimation ◽

Video Compression ◽

Quality Evaluation ◽

Point Of View ◽

Video Sequences ◽

Compression Algorithms ◽

Estimation Algorithms ◽

Block Based ◽

A Performance

Motion estimation is a method, by which temporal redundancies are reduced, which is an important aspect of video compression algorithms. In this paper we present a comparison among some of the well-known block based motion estimation algorithms. A performance evaluation of these algorithms is proposed to decide the best algorithm from the point of view of complexity and quality for noise-free video sequences and also for noisy video sequences.

Download Full-text

Depth Map Video Compression Performance Evaluation For Ieee 1857.9

2021 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) ◽

10.1109/icmew53276.2021.9455977 ◽

2021 ◽

Author(s):

Yangang Cai ◽

Ronggang Wang ◽

Ke Qiu ◽

Rui Peng ◽

Zhipeng Cheng ◽

...

Keyword(s):

Performance Evaluation ◽

Video Compression ◽

Depth Map ◽

Compression Performance

Download Full-text

Comparison of 3D 360-Degree Video Compression Performance Using Different Projections

2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE) ◽

10.1109/ccece.2019.8861868 ◽

2019 ◽

Cited By ~ 2

Author(s):

Mohammadreza Jamali ◽

Firouzeh Golaghazadeh ◽

Stephane Coulombe ◽

Ahmad Vakili ◽

Carlos Vazquez

Keyword(s):

Video Compression ◽

Compression Performance

Download Full-text

Insights on Video Compression Strategies using Machine Learning

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a9756.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 2724-2733

Keyword(s):

Machine Learning ◽

Video Compression ◽

Research Problem ◽

Learning Approach ◽

Unsolved Research Problem ◽

Compression Technique ◽

Compression Performance ◽

Machine Learning Approach ◽

Research Problems ◽

Research Gap

With the rising advancement of the multimedia technology, video compression is becoming a challenging problem. Although, there is availability of various standard compression algorithms, yet robust compression performance is yet to be seen in existing compression techniques. This paper also highlights that machine learning plays a significant contributory role in improving the performance of the video compression. Therefore, this manuscript offers a technical insight about the performance of existing video compression technique using machine learning approach. The contribution of this paper is its findings which states that machine learning approach do have significant advantage but the advantageous features are limited by the inherent and unsolved research problem. The core findings of this paper are basically to highlight the strength and limitations of existing methods as well as to highlight the research gap in terms of open-end research problems which requires immediate attention.

Download Full-text

High-speed Instruction-set Coprocessor for Lattice-based Key Encapsulation Mechanism: Saber in Hardware

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2020.i4.443-466 ◽

2020 ◽

pp. 443-466

Author(s):

Sujoy Sinha Roy ◽

Andrea Basso

Keyword(s):

High Speed ◽

Critical Role ◽

Computation Time ◽

Public Key Cryptography ◽

Instruction Set ◽

Clock Frequency ◽

Polynomial Multiplication ◽

Trade Offs ◽

Key Encapsulation Mechanism ◽

Scale Design

In this paper, we present an instruction set coprocessor architecture for lattice-based cryptography and implement the module lattice-based post-quantum key encapsulation mechanism (KEM) Saber as a case study. To achieve fast computation time, the architecture is fully implemented in hardware, including CCA transformations. Since polynomial multiplication plays a performance-critical role in the module and ideal lattice-based public-key cryptography, a parallel polynomial multiplier architecture is proposed that overcomes memory access bottlenecks and results in a highly parallel yet simple and easy-to-scale design. Such multipliers can compute a full multiplication in 256 cycles, but are designed to target any area/performance trade-offs. Besides optimizing polynomial multiplication, we make important design decisions and perform architectural optimizations to reduce the overall cycle counts as well as improve resource utilization. For the module dimension 3 (security comparable to AES-192), the coprocessor computes CCA key generation, encapsulation, and decapsulation in only 5,453, 6,618 and 8,034 cycles respectively, making it the fastest hardware implementation of Saber to our knowledge. On a Xilinx UltraScale+ XCZU9EG-2FFVB1156 FPGA, the entire instruction set coprocessor architecture runs at 250 MHz clock frequency and consumes 23,686 LUTs, 9,805 FFs, and 2 BRAM tiles (including 5,113 LUTs and 3,068 FFs for the Keccak core).

Download Full-text

Video Compression for Surveillance Application using Deep Neural Network

Journal of Artificial Intelligence and Capsule Networks - September 2019 ◽

10.36548/jaicn.2020.2.006 ◽

2020 ◽

Vol 2 (2) ◽

pp. 131-145

Author(s):

Prasanga Dhungel ◽

Prashant Tandan ◽

Sandesh Bhusal ◽

Sobit Neupane ◽

Subarna Shakya

Keyword(s):

Neural Network ◽

Video Compression ◽

Rate Distortion ◽

General Purpose ◽

New Approach ◽

Video Frames ◽

Spatio Temporal ◽

Temporal Redundancy ◽

Mpeg Encoding ◽

Surveillance Application

We present a new approach to video compression for video surveillance by refining the shortcomings of conventional approach and substitute each traditional component with their neural network counterpart. Our proposed work consists of motion estimation, compression and compensation and residue compression, learned end-to-end to minimize the rate-distortion trade off. The whole model is jointly optimized using a single loss function. Our work is based on a standard method to exploit the spatio-temporal redundancy in video frames to reduce the bit rate along with the minimization of distortions in decoded frames. We implement a neural network version of conventional video compression approach and encode the redundant frames with lower number of bits. Although, our approach is more concerned toward surveillance, it can be extended easily to general purpose videos too. Experiments show that our technique is efficient and outperforms standard MPEG encoding at comparable bitrates while preserving the visual quality.

Download Full-text

Informal subjective quality comparison of video compression performance of the HEVC and H.264/MPEG-4 AVC standards for low-delay applications

10.1117/12.953235 ◽

2012 ◽

Cited By ~ 11

Author(s):

Michael Horowitz ◽

Faouzi Kossentini ◽

Nader Mahdi ◽

Shilin Xu ◽

Hsan Guermazi ◽

...

Keyword(s):

Video Compression ◽

Subjective Quality ◽

Compression Performance ◽

Low Delay

Download Full-text