Pushing the Limits of High-Speed GF(2 m ) Elliptic Curve Scalar Multiplication on FPGAs

The higher computational complexity of an elliptic curve scalar point multiplication operation limits its implementation on general purpose processors. Dedicated hardware architectures are essential to reduce the computational time, which results in a substantial increase in the performance of associated cryptographic protocols. This paper presents a unified architecture to compute modular addition, subtraction, and multiplication operations over a finite field of large prime characteristicGF(p). Subsequently, dual instances of the unified architecture are utilized in the design of high speed elliptic curve scalar multiplier architecture. The proposed architecture is synthesized and implemented on several different Xilinx FPGA platforms for different field sizes. The proposed design computes a 192-bit elliptic curve scalar multiplication in 2.3 ms on Virtex-4 FPGA platform. It is 34%faster and requires 40%fewer clock cycles for elliptic curve scalar multiplication and consumes considerable fewer FPGA slices as compared to the other existing designs. The proposed design is also resistant to the timing and simple power analysis (SPA) attacks; therefore it is a good choice in the construction of fast and secure elliptic curve based cryptographic protocols.

Download Full-text

High-Speed Implementation of Elliptic Curve Scalar Multiplication on FPGAs

Hardware Security ◽

10.1201/b17555-9 ◽

2014 ◽

pp. 153-175

Keyword(s):

Elliptic Curve ◽

High Speed ◽

Scalar Multiplication ◽

Elliptic Curve Scalar Multiplication

Download Full-text

Implementation of an Elliptic Curve Scalar Multiplication Method Using Division Polynomials

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e97.a.300 ◽

2014 ◽

Vol E97.A (1) ◽

pp. 300-302 ◽

Cited By ~ 3

Author(s):

Naoki KANAYAMA ◽

Yang LIU ◽

Eiji OKAMOTO ◽

Kazutaka SAITO ◽

Tadanori TERUYA ◽

...

Keyword(s):

Elliptic Curve ◽

Scalar Multiplication ◽

Division Polynomials ◽

Elliptic Curve Scalar Multiplication

Download Full-text

High-Performance Pipelined Architecture of Elliptic Curve Scalar Multiplication Over GF(2m)

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2015.2453360 ◽

2016 ◽

Vol 24 (4) ◽

pp. 1223-1232 ◽

Cited By ~ 20

Author(s):

Lijuan Li ◽

Shuguo Li

Keyword(s):

Elliptic Curve ◽

High Performance ◽

Scalar Multiplication ◽

Pipelined Architecture ◽

Elliptic Curve Scalar Multiplication

Download Full-text

Ternary Method in Elliptic Curve Scalar Multiplication

2013 5th International Conference on Intelligent Networking and Collaborative Systems ◽

10.1109/incos.2013.93 ◽

2013 ◽

Cited By ~ 1

Author(s):

Ning Zhang ◽

Xiaotong Fu

Keyword(s):

Elliptic Curve ◽

Scalar Multiplication ◽

Elliptic Curve Scalar Multiplication

Download Full-text

MicroBlaze-Based Multiprocessor Embedded Cryptosystem on FPGA for Elliptic Curve Scalar Multiplication Over Fp

Journal of Circuits System and Computers ◽

10.1142/s0218126619500373 ◽

2019 ◽

Vol 28 (03) ◽

pp. 1950037 ◽

Cited By ~ 3

Author(s):

A. Bellemou ◽

N. Benblidia ◽

M. Anane ◽

M. Issad

Keyword(s):

Elliptic Curve ◽

Scalar Multiplication ◽

Security Level ◽

Prime Field ◽

Montgomery Modular Multiplication ◽

Parallel Implementations ◽

High Radix ◽

System On Programmable Chip ◽

Xilinx Fpga ◽

Elliptic Curve Scalar Multiplication

In this paper, we present Microblaze-based parallel architectures of Elliptic Curve Scalar Multiplication (ECSM) computation for embedded Elliptic Curve Cryptosystem (ECC) on Xilinx FPGA. The proposed implementations support arbitrary Elliptic Curve (EC) forms defined over large prime field ([Formula: see text]) with different security-level sizes. ECSM is performed using Montgomery Power Ladder (MPL) algorithm in Chudnovsky projective coordinates system. At the low abstraction level, Montgomery Modular Multiplication (MMM) is considered as the critical operation. It is implemented within a hardware Accelerator MMM (AccMMM) core based on the modified high radix, [Formula: see text] MMM algorithm. The efficiency of our parallel implementations is achieved by the combination of the mixed SW/HW approach with Multi Processor System on Programmable Chip (MPSoPC) design. The integration of multi MicroBlaze processor in single architecture allows not only the flexibility of the overall system but also the exploitation of the parallelism in ECSM computation with several degrees. The Virtex-5 parallel implementations of 256-bit and 521-bis ECSM computations run at 100[Formula: see text]MHZ frequency and consume between 2,739 and 6,533 slices, 22 and 72 RAMs and between 16 and 48 DSP48E cores. For the considered security-level sizes, the delays to perform single ECSM are between 115[Formula: see text]ms and 14.72[Formula: see text]ms.

Download Full-text