polynomial multiplication Latest Research Papers

High-degree, low-precision polynomial arithmetic is a fundamental computational primitive underlying structured lattice based cryptography. Its algorithmic properties and suitability for implementation on different compute platforms is an active area of research, and this article contributes to this line of work: Firstly, we present memory-efficiency and performance improvements for the Toom-Cook/Karatsuba polynomial multiplication strategy. Secondly, we provide implementations of those improvements on Arm® Cortex®-M4 CPU, as well as the newer Cortex-M55 processor, the first M-profile core implementing the M-profile Vector Extension (MVE), also known as Arm® Helium™ technology. We also implement the Number Theoretic Transform (NTT) on the Cortex-M55 processor. We show that despite being singleissue, in-order and offering only 8 vector registers compared to 32 on A-profile SIMD architectures like Arm® Neon™ technology and the Scalable Vector Extension (SVE), by careful register management and instruction scheduling, we can obtain a 3× to 5× performance improvement over already highly optimized implementations on Cortex-M4, while maintaining a low area and energy profile necessary for use in embedded market. Finally, as a real-world application we integrate our multiplication techniques to post-quantum key-encapsulation mechanism Saber

Download Full-text

Neon NTT: Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2022.i1.221-244 ◽

2021 ◽

pp. 221-244

Author(s):

Hanno Becker ◽

Vincent Hwang ◽

Matthias J. Kannwischer ◽

Bo-Yin Yang ◽

Shang-Yi Yang

Keyword(s):

State Of The Art ◽

Polynomial Multiplication ◽

Montgomery Multiplication ◽

The Core ◽

Multi Stage ◽

Unknown Factor ◽

The Matrix ◽

Improved Technique ◽

Matrix Vector ◽

Vector Polynomial

We present new speed records on the Armv8-A architecture for the latticebased schemes Dilithium, Kyber, and Saber. The core novelty in this paper is the combination of Montgomery multiplication and Barrett reduction resulting in “Barrett multiplication” which allows particularly efficient modular one-known-factor multiplication using the Armv8-A Neon vector instructions. These novel techniques combined with fast two-unknown-factor Montgomery multiplication, Barrett reduction sequences, and interleaved multi-stage butterflies result in significantly faster code. We also introduce “asymmetric multiplication” which is an improved technique for caching the results of the incomplete NTT, used e.g. for matrix-to-vector polynomial multiplication. Our implementations target the Arm Cortex-A72 CPU, on which our speed is 1.7× that of the state-of-the-art matrix-to-vector polynomial multiplication in kyber768 [Nguyen–Gaj 2021]. For Saber, NTTs are far superior to Toom–Cook multiplication on the Armv8-A architecture, outrunning the matrix-to-vector polynomial multiplication by 2.0×. On the Apple M1, our matrix-vector products run 2.1× and 1.9× faster for Kyber and Saber respectively.

Download Full-text

Racing BIKE: Improved Polynomial Multiplication and Inversion in Hardware

IACR Transactions on Cryptographic Hardware and Embedded Systems ◽

10.46586/tches.v2022.i1.557-588 ◽

2021 ◽

pp. 557-588

Author(s):

Jan Richter-Brockmann ◽

Ming-Shing Chen ◽

Santosh Ghosh ◽

Tim Güneysu

Keyword(s):

High Speed ◽

Optimized Design ◽

Key Generation ◽

Shared Resources ◽

Polynomial Multiplication ◽

Sparse Polynomial ◽

Sparse Polynomials ◽

Key Encapsulation Mechanism ◽

Standardization Process ◽

High Speed Design

BIKE is a Key Encapsulation Mechanism selected as an alternate candidate in NIST’s PQC standardization process, in which performance plays a significant role in the third round. This paper presents FPGA implementations of BIKE with the best area-time performance reported in literature. We optimize two key arithmetic operations, which are the sparse polynomial multiplication and the polynomial inversion. Our sparse multiplier achieves time-constancy for sparse polynomials of indefinite Hamming weight used in BIKE’s encapsulation. The polynomial inversion is based on the extended Euclidean algorithm, which is unprecedented in current BIKE implementations. Our optimized design results in a 5.5 times faster key generation compared to previous implementations based on Fermat’s little theorem.Besides the arithmetic optimizations, we present a united hardware design of BIKE with shared resources and shared sub-modules among KEM functionalities. On Xilinx Artix-7 FPGAs, our light-weight implementation consumes only 3 777 slices and performs a key generation, encapsulation, and decapsulation in 3 797 μs, 443 μs, and 6 896 μs, respectively. Our high-speed design requires 7 332 slices and performs the three KEM operations in 1 672 μs, 132 μs, and 1 892 μs, respectively.

Download Full-text

A Polynomial Multiplication Accelerator for Homomorphic Encryption using DGT

10.1109/asid52932.2021.9651679 ◽

2021 ◽

Author(s):

Jigang Yang ◽

Zhenmin Li ◽

Jingwei Ren ◽

Xiaolei Wang ◽

Wei Ni ◽

...

Keyword(s):

Homomorphic Encryption ◽

Polynomial Multiplication

Download Full-text

CROP: FPGA Implementation of High-Performance Polynomial Multiplication in Saber KEM based on Novel Cyclic-Row Oriented Processing Strategy

10.1109/iccd53106.2021.00031 ◽

2021 ◽

Author(s):

Jiafeng Xie ◽

Pengzhou He ◽

Chiou-Yng Lee

Keyword(s):

High Performance ◽

Fpga Implementation ◽

Processing Strategy ◽

Polynomial Multiplication

Download Full-text

Automatic Library Generation and Performance Tuning for Modular Polynomial Multiplication

10.17918/etd-6611 ◽

2021 ◽

Author(s):

Lingchuan Meng

Keyword(s):

Performance Tuning ◽

Polynomial Multiplication ◽

And Performance ◽

Modular Polynomial ◽

Automatic Library Generation

Download Full-text

Blockchain-Based Secure Outsourcing of Polynomial Multiplication and Its Application in Fully Homomorphic Encryption

Security and Communication Networks ◽

10.1155/2021/9962575 ◽

2021 ◽

Vol 2021 ◽

pp. 1-14

Author(s):

Mingyang Song ◽

Yingpeng Sang ◽

Yuying Zeng ◽

Shunchao Luo

Keyword(s):

Homomorphic Encryption ◽

Security Analysis ◽

Secure Computation ◽

Modular Exponentiation ◽

Local Computation ◽

Fully Homomorphic Encryption ◽

Polynomial Multiplication ◽

Secure Outsourcing ◽

Cheating Detection ◽

Constrained Devices

The efficiency of fully homomorphic encryption has always affected its practicality. With the dawn of Internet of things, the demand for computation and encryption on resource-constrained devices is increasing. Complex cryptographic computing is a major burden for those devices, while outsourcing can provide great convenience for them. In this paper, we firstly propose a generic blockchain-based framework for secure computation outsourcing and then propose an algorithm for secure outsourcing of polynomial multiplication into the blockchain. Our algorithm for polynomial multiplication can reduce the local computation cost to O n . Previous work based on Fast Fourier Transform can only achieve O n log n for the local cost. Finally, we integrate the two secure outsourcing schemes for polynomial multiplication and modular exponentiation into the fully homomorphic encryption using hidden ideal lattice and get an outsourcing scheme of fully homomorphic encryption. Through security analysis, our schemes achieve the goals of privacy protection against passive attackers and cheating detection against active attackers. Experiments also demonstrate our schemes are more efficient in comparisons with the corresponding nonoutsourcing schemes.

Download Full-text

Reduced-Complexity Modular Polynomial Multiplication for R-LWE Cryptosystems

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414005 ◽

2021 ◽

Author(s):

Xinmiao Zhang ◽

Keshab K. Parhi

Keyword(s):

Polynomial Multiplication ◽

Reduced Complexity ◽

Modular Polynomial

Download Full-text

High-Speed NTT-based Polynomial Multiplication Accelerator for Post-Quantum Cryptography

10.1109/arith51176.2021.00028 ◽

2021 ◽

Author(s):

Mojtaba Bisheh-Niasar ◽

Reza Azarderakhsh ◽

Mehran Mozaffari-Kermani

Keyword(s):

Quantum Cryptography ◽

High Speed ◽

Polynomial Multiplication ◽

Post Quantum Cryptography

Download Full-text

polynomial multiplication
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Faster characteristic three polynomial multiplication and its application to NTRU Prime decapsulation

Polynomial multiplication on embedded vector architectures

Neon NTT: Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1

Racing BIKE: Improved Polynomial Multiplication and Inversion in Hardware

A Polynomial Multiplication Accelerator for Homomorphic Encryption using DGT

CROP: FPGA Implementation of High-Performance Polynomial Multiplication in Saber KEM based on Novel Cyclic-Row Oriented Processing Strategy

Automatic Library Generation and Performance Tuning for Modular Polynomial Multiplication

Blockchain-Based Secure Outsourcing of Polynomial Multiplication and Its Application in Fully Homomorphic Encryption

Reduced-Complexity Modular Polynomial Multiplication for R-LWE Cryptosystems

High-Speed NTT-based Polynomial Multiplication Accelerator for Post-Quantum Cryptography

Export Citation Format

polynomial multiplicationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Faster characteristic three polynomial multiplication and its application to NTRU Prime decapsulation

Polynomial multiplication on embedded vector architectures

Neon NTT: Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1

Racing BIKE: Improved Polynomial Multiplication and Inversion in Hardware

A Polynomial Multiplication Accelerator for Homomorphic Encryption using DGT

CROP: FPGA Implementation of High-Performance Polynomial Multiplication in Saber KEM based on Novel Cyclic-Row Oriented Processing Strategy

Automatic Library Generation and Performance Tuning for Modular Polynomial Multiplication

Blockchain-Based Secure Outsourcing of Polynomial Multiplication and Its Application in Fully Homomorphic Encryption

Reduced-Complexity Modular Polynomial Multiplication for R-LWE Cryptosystems

High-Speed NTT-based Polynomial Multiplication Accelerator for Post-Quantum Cryptography

polynomial multiplication
Recently Published Documents