Three hardware architectures for the binary modular exponentiation: sequential, parallel, and systolic

AbstractMontgomery’s and Barrett’s modular multiplication algorithms are widely used in modular exponentiation algorithms, e.g. to compute RSA or ECC operations. While Montgomery’s multiplication algorithm has been studied extensively in the literature and many side-channel attacks have been detected, to our best knowledge no thorough analysis exists for Barrett’s multiplication algorithm. This article closes this gap. For both Montgomery’s and Barrett’s multiplication algorithm, differences of the execution times are caused by conditional integer subtractions, so-called extra reductions. Barrett’s multiplication algorithm allows even two extra reductions, and this feature increases the mathematical difficulties significantly. We formulate and analyse a two-dimensional Markov process, from which we deduce relevant stochastic properties of Barrett’s multiplication algorithm within modular exponentiation algorithms. This allows to transfer the timing attacks and local timing attacks (where a second side-channel attack exhibits the execution times of the particular modular squarings and multiplications) on Montgomery’s multiplication algorithm to attacks on Barrett’s algorithm. However, there are also differences. Barrett’s multiplication algorithm requires additional attack substeps, and the attack efficiency is much more sensitive to variations of the parameters. We treat timing attacks on RSA with CRT, on RSA without CRT, and on Diffie–Hellman, as well as local timing attacks against these algorithms in the presence of basis blinding. Experiments confirm our theoretical results.

Download Full-text

Embedded Deep Learning Prototyping Approach for Cyber-Physical Systems: Smart LIDAR Case Study

Journal of Sensor and Actuator Networks ◽

10.3390/jsan10010018 ◽

2021 ◽

Vol 10 (1) ◽

pp. 18

Author(s):

Quentin Cabanes ◽

Benaoumeur Senouci ◽

Amar Ramdane-Cherif

Keyword(s):

Neural Network ◽

Deep Learning ◽

Computation Time ◽

Physical World ◽

Cyber Physical Systems ◽

Physical Systems ◽

Research Technology ◽

Standard Design ◽

Hardware Architectures ◽

Voxel Grid

Cyber-Physical Systems (CPSs) are a mature research technology topic that deals with Artificial Intelligence (AI) and Embedded Systems (ES). They interact with the physical world via sensors/actuators to solve problems in several applications (robotics, transportation, health, etc.). These CPSs deal with data analysis, which need powerful algorithms combined with robust hardware architectures. On one hand, Deep Learning (DL) is proposed as the main solution algorithm. On the other hand, the standard design and prototyping methodologies for ES are not adapted to modern DL-based CPS. In this paper, we investigate AI design for CPS around embedded DL. The main contribution of this work is threefold: (1) We define an embedded DL methodology based on a Multi-CPU/FPGA platform. (2) We propose a new hardware design architecture of a Neural Network Processor (NNP) for DL algorithms. The computation time of a feed forward sequence is estimated to 23 ns for each parameter. (3) We validate the proposed methodology and the DL-based NNP using a smart LIDAR application use-case. The input of our NNP is a voxel grid hardware computed from 3D point cloud. Finally, the results show that our NNP is able to process Dense Neural Network (DNN) architecture without bias.

Download Full-text

A Benchmarking of the Effectiveness of Modular Exponentiation Algorithms using the library GMP in C language

2020 International Conference on Computational Intelligence (ICCI) ◽

10.1109/icci51257.2020.9247766 ◽

2020 ◽

Author(s):

Tran Quy Ban ◽

Tran Thi Thuy Nguyen ◽

Vu Thanh Long ◽

Pham Dang Dung ◽

Bui Thanh Tung

Keyword(s):

Modular Exponentiation ◽

C Language

Download Full-text

Simulation of Modular Exponentiation Circuit for Shor's Algorithm in Qiskit

2020 14th International Conference on Telecommunication Systems, Services, and Applications (TSSA ◽

10.1109/tssa51342.2020.9310794 ◽

2020 ◽

Author(s):

Harashta Tatimma Larasati ◽

Howon Kim

Keyword(s):

Modular Exponentiation ◽

Shor's Algorithm

Download Full-text

Study on Several Fast Algorithm of Modular Exponentiation in RSA

2011 International Conference on Network Computing and Information Security ◽

10.1109/ncis.2011.82 ◽

2011 ◽

Author(s):

Tong Zhou

Keyword(s):

Fast Algorithm ◽

Modular Exponentiation

Download Full-text

Hardware architectures for inversion in GF(2m) using polynomial and gaussian normal basis

2010 IEEE ANDESCON ◽

10.1109/andescon.2010.5633137 ◽

2010 ◽

Cited By ~ 2

Author(s):

Vladimir Trujillo-Olaya ◽

Jaime Velasco-Med

Keyword(s):

Normal Basis ◽

Gaussian Normal Basis ◽

Hardware Architectures

Download Full-text

Embryonic systems implementation with FPGA-based artificial cell network hardware architectures

Asian Journal of Control ◽

10.1002/asjc.166 ◽

2010 ◽

Vol 12 (2) ◽

pp. 208-215 ◽

Cited By ~ 6

Author(s):

Csaba Szász ◽

Virgil Chindriş ◽

Géza Husi

Keyword(s):

Cell Network ◽

Systems Implementation ◽

Artificial Cell ◽

Hardware Architectures

Download Full-text

Fast modular exponentiation and elliptic curve group operation in Maple

International Journal of Mathematical Education in Science and Technology ◽

10.1080/00207390600712422 ◽

2006 ◽

Vol 37 (6) ◽

pp. 745-753

Author(s):

S. Y. Yan ◽

G. James

Keyword(s):

Elliptic Curve ◽

Modular Exponentiation ◽

Group Operation

Download Full-text

Operating System for Runtime Reconfigurable Multiprocessor Systems

International Journal of Reconfigurable Computing ◽

10.1155/2011/121353 ◽

2011 ◽

Vol 2011 ◽

pp. 1-16 ◽

Cited By ~ 16

Author(s):

Diana Göhringer ◽

Michael Hübner ◽

Etienne Nguepi Zeutebouo ◽

Jürgen Becker

Keyword(s):

Operating System ◽

Resource Management ◽

Multiprocessor System ◽

Task Mapping ◽

Access Port ◽

Novel Approach ◽

Hardware Resource ◽

Hardware Architectures ◽

On Chip ◽

Internal Configuration

Operating systems traditionally handle the task scheduling of one or more application instances on processor-like hardware architectures. RAMPSoC, a novel runtime adaptive multiprocessor System-on-Chip, exploits the dynamic reconfiguration on FPGAs to generate, start and terminate hardware and software tasks. The hardware tasks have to be transferred to the reconfigurable hardware via a configuration access port. The software tasks can be loaded into the local memory of the respective IP core either via the configuration access port or via the on-chip communication infrastructure (e.g. a Network-on-Chip). Recent-series of Xilinx FPGAs, such as Virtex-5, provide two Internal Configuration Access Ports, which cannot be accessed simultaneously. To prevent conflicts, the access to these ports as well as the hardware resource management needs to be controlled, e.g. by a special-purpose operating system running on an embedded processor. For that purpose and to handle the relations between temporally and spatially scheduled operations, the novel approach of an operating system is of high importance. This special purpose operating system, called CAP-OS (Configuration Access Port-Operating System), which will be presented in this paper, supports the clients using the configuration port with the services of priority-based access scheduling, hardware task mapping and resource management.

Download Full-text