High Performance Reconfigurable Elliptic Curve Cipher Processor Implementation

Abstract Elliptic Curve Encryption (ECC) has been widely used in the field of digital signatures in communication security. ECC standards and the diversification of application scenarios put forward higher requirements for the flexibility of ECC processors. Therefore, it is necessary to design a flexible and reconfigurable processor to adapt to changing standards. The cryptographic processor chip designed in this paper supports the choice of prime and binary fields, supports the maximum key length of 576 bits, uses microcode programming to achieve reconfigurable function, and significantly improves the flexibility of the dedicated cryptographic processor. At the same time, the speed of modular multiplication and modular division can be greatly improved under the condition of keeping the low level of hardware resources through a carefully designed modular unit of operation. After using FPGA for hardware implementation, it is configured into a 256-bit key length. The highest clock frequency of this design can reach 55.7MHz, occupying 12425LUTS. Compared with a similar design, the performance is also greatly improved. After MALU module optimization design, modular multiplication module division also has significant advantages in computing time consumption.

Download Full-text

A High-Speed Elliptic Curve Cryptography Processor for Teleoperated Systems Security

Mathematical Problems in Engineering ◽

10.1155/2021/6633925 ◽

2021 ◽

Vol 2021 ◽

pp. 1-8

Author(s):

Yong Xiao ◽

Weibin Lin ◽

Yun Zhao ◽

Chao Cui ◽

Ziwen Cai

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

High Speed ◽

High Performance ◽

Prime Field ◽

Practical Applications ◽

Cell Library ◽

Cryptographic Algorithm ◽

Human Operators ◽

Key Length

Teleoperated robotic systems are those in which human operators control remote robots through a communication network. The deployment and integration of teleoperated robot’s systems in the medical operation have been hampered by many issues, such as safety concerns. Elliptic curve cryptography (ECC), an asymmetric cryptographic algorithm, is widely applied to practical applications because its far significantly reduced key length has the same level of security as RSA. The efficiency of ECC on GF (p) is dictated by two critical factors, namely, modular multiplication (MM) and point multiplication (PM) scheduling. In this paper, the high-performance ECC architecture of SM2 is presented. MM is composed of multiplication and modular reduction (MR) in the prime field. A two-stage modular reduction (TSMR) algorithm in the SCA-256 prime field is introduced to achieve low latency, which avoids more iterative subtraction operations than traditional algorithms. To cut down the run time, a schedule is put forward when exploiting the parallelism of multiplication and MR inside PM. Synthesized with a 0.13 um CMOS standard cell library, the proposed processor consumes 341.98k gate areas, and each PM takes 0.092 ms.

Download Full-text

A Hardware-Accelerated ECDLP with High-Performance Modular Multiplication

International Journal of Reconfigurable Computing ◽

10.1155/2012/439021 ◽

2012 ◽

Vol 2012 ◽

pp. 1-14 ◽

Cited By ~ 4

Author(s):

Lyndon Judge ◽

Suvarna Mane ◽

Patrick Schaumont

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

High Performance ◽

Design Space ◽

Discrete Logarithm ◽

Public Key Cryptography ◽

Modular Multiplication ◽

Polynomial Representation ◽

Prime Field ◽

Modular Multiplier

Elliptic curve cryptography (ECC) has become a popular public key cryptography standard. The security of ECC is due to the difficulty of solving the elliptic curve discrete logarithm problem (ECDLP). In this paper, we demonstrate a successful attack on ECC over prime field using the Pollard rho algorithm implemented on a hardware-software cointegrated platform. We propose a high-performance architecture for multiplication over prime field using specialized DSP blocks in the FPGA. We characterize this architecture by exploring the design space to determine the optimal integer basis for polynomial representation and we demonstrate an efficient mapping of this design to multiple standard prime field elliptic curves. We use the resulting modular multiplier to demonstrate low-latency multiplications for curves secp112r1 and P-192. We apply our modular multiplier to implement a complete attack on secp112r1 using a Nallatech FSB-Compute platform with Virtex-5 FPGA. The measured performance of the resulting design is 114 cycles per Pollard rho step at 100 MHz, which gives 878 K iterations per second per ECC core. We extend this design to a multicore ECDLP implementation that achieves 14.05 M iterations per second with 16 parallel point addition cores.

Download Full-text

High Performance FPGA Implementation of Elliptic Curve Cryptography over Binary Fields

2014 IEEE 13th International Conference on Trust, Security and Privacy in Computing and Communications ◽

10.1109/trustcom.2014.23 ◽

2014 ◽

Cited By ~ 13

Author(s):

Shuai Liu ◽

Lei Ju ◽

Xiaojun Cai ◽

Zhiping Jia ◽

Zhiyong Zhang

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

High Performance ◽

Fpga Implementation ◽

Binary Fields

Download Full-text

High-Performance Elliptic Curve Cryptography by Using the CIOS Method for Modular Multiplication

Lecture Notes in Computer Science - Risks and Security of Internet and Systems ◽

10.1007/978-3-319-54876-0_15 ◽

2017 ◽

pp. 185-198 ◽

Cited By ~ 2

Author(s):

Amine Mrabet ◽

Nadia El-Mrabet ◽

Ronan Lashermes ◽

Jean-Baptiste Rigaud ◽

Belgacem Bouallegue ◽

...

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

High Performance ◽

Modular Multiplication

Download Full-text

Erratum: A High Performance FPGA Implementation of 256-bit Elliptic Curve Cryptography Processor Over GF(p) [IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences Vol. E98.A (2015) , No. 3 pp.863-869]

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e98.a.1057_e1 ◽

2015 ◽

Vol E98.A (4) ◽

pp. 1057_e1-1057_e1

Author(s):

Xiang FENG ◽

Shuguo LI

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

High Performance ◽

Fpga Implementation

Download Full-text

Low Power Wide Fan-in Domino OR Gate Using CN-MOSFETs

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327909666190207163639 ◽

2020 ◽

Vol 10 (1) ◽

pp. 55-62

Author(s):

Deepika Bansal ◽

Bal Chand Nagar ◽

Brahamdeo Prasad Singh ◽

Ajay Kumar

Keyword(s):

Power Consumption ◽

High Performance ◽

Dynamic Logic ◽

Clock Frequency ◽

Charge Sharing ◽

Benchmark Circuit ◽

Domino Circuit ◽

Power Delay Product ◽

Domino Circuits ◽

Or Gate

Background & Objective: In this paper, a modified pseudo domino configuration has been proposed to improve the leakage power consumption and Power Delay Product (PDP) of dynamic logic using Carbon Nanotube MOSFETs (CN-MOSFETs). The simulations for proposed and published domino circuits are verified by using Synopsys HSPICE simulator with 32nm CN-MOSFET technology which is provided by Stanford. Methods: The simulation results of the proposed technique are validated for improvement of wide fan-in domino OR gate as a benchmark circuit at 500 MHz clock frequency. Results: The proposed configuration is suitable for cascading of the high performance wide fan-in circuits without any charge sharing. Conclusion: The performance analysis of 8-input OR gate demonstrate that the proposed circuit provides lower static and dynamic power consumption up to 62 and 40% respectively, and PDP improvement is 60% as compared to standard domino circuit.

Download Full-text

Highly Area-Efficient Implementation of Modular Multiplication for Elliptic Curve Cryptography

2020 IEEE Region 10 Symposium (TENSYMP) ◽

10.1109/tensymp50017.2020.9230990 ◽

2020 ◽

Author(s):

Md. Sazedur Rahman ◽

Md. Selim Hossain

Keyword(s):

Elliptic Curve ◽

Elliptic Curve Cryptography ◽

Efficient Implementation ◽

Modular Multiplication ◽

Area Efficient

Download Full-text

High-Performance Pipelined Architecture of Elliptic Curve Scalar Multiplication Over GF(2m)

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2015.2453360 ◽

2016 ◽

Vol 24 (4) ◽

pp. 1223-1232 ◽

Cited By ~ 20

Author(s):

Lijuan Li ◽

Shuguo Li

Keyword(s):

Elliptic Curve ◽

High Performance ◽

Scalar Multiplication ◽

Pipelined Architecture ◽

Elliptic Curve Scalar Multiplication

Download Full-text

Core-Level Modeling and Frequency Prediction for DSP Applications on FPGAs

International Journal of Reconfigurable Computing ◽

10.1155/2015/784672 ◽

2015 ◽

Vol 2015 ◽

pp. 1-20

Author(s):

Gongyu Wang ◽

Greg Stitt ◽

Herman Lam ◽

Alan George

Keyword(s):

High Performance ◽

Design Space Exploration ◽

Design Space ◽

Space Exploration ◽

Core Level ◽

Prediction Methods ◽

Clock Frequency ◽

Worst Case ◽

Model Based ◽

Dsp Applications

Field-programmable gate arrays (FPGAs) provide a promising technology that can improve performance of many high-performance computing and embedded applications. However, unlike software design tools, the relatively immature state of FPGA tools significantly limits productivity and consequently prevents widespread adoption of the technology. For example, the lengthy design-translate-execute (DTE) process often must be iterated to meet the application requirements. Previous works have enabled model-based, design-space exploration to reduce DTE iterations but are limited by a lack of accurate model-based prediction of key design parameters, the most important of which is clock frequency. In this paper, we present a core-level modeling and design (CMD) methodology that enables modeling of FPGA applications at an abstract level and yet produces accurate predictions of parameters such as clock frequency, resource utilization (i.e., area), and latency. We evaluate CMD’s prediction methods using several high-performance DSP applications on various families of FPGAs and show an average clock-frequency prediction error of 3.6%, with a worst-case error of 20.4%, compared to the best of existing high-level prediction methods, 13.9% average error with 48.2% worst-case error. We also demonstrate how such prediction enables accurate design-space exploration without coding in a hardware-description language (HDL), significantly reducing the total design time.

Download Full-text

HIPERMA: A high performance and reconfigurable processor for SAR applications

2007 1st Asian and Pacific Conference on Synthetic Aperture Radar ◽

10.1109/apsar.2007.4418599 ◽

2007 ◽

Author(s):

Shushan Qiao ◽

Yong Hei ◽

Xinfeng Xu ◽

Bin Wu ◽

Yumei Zhou

Keyword(s):

High Performance ◽

Reconfigurable Processor

Download Full-text