A high-throughput scalable BNN accelerator with fully pipelined architecture

Advanced Encryption Standard (AES) is the most popular symmetric encryption method, which encrypts streams of data by using symmetric keys. The current preferable AES architectures employ effective methods to achieve two important goals: protection against power analysis attacks and high-throughput. Based on a different architectural point of view, we implement a particular parallel architecture for the latter goal, which is capable of implementing a more efficient pipelining in field-programmable gate array (FPGA). In this regard, all intermediate registers which have a role for unrolling the main loop will be removed. Also, instead of unrolling the main loop of AES algorithm, we implement pipelining structure by replicating nonpipelined AES architectures and using an auto-assigner mechanism for each AES block. By implementing the new pipelined architecture, we achieve two valuable advantages: (a) solving single point of failure problem when one of the replicated parts is faulty and (b) deploying the proposed design as a fault tolerant AES architecture. In addition, we put emphasis on area optimization for all four AES main functions to reduce the overhead associated with AES block replication. The simulation results show that the maximum frequency of our proposed AES architecture is 675.62[Formula: see text]MHz, and for AES128 the throughput is 86.5[Formula: see text]Gbps which is 30.9% better than its closest existing competitor.

Download Full-text

High Throughput, low cost, Fully Pipelined Architecture for AES Crypto Chip

2006 Annual IEEE India Conference ◽

10.1109/indcon.2006.302814 ◽

2006 ◽

Cited By ~ 15

Author(s):

Nalini Iyer ◽

P.V. Anandmohan ◽

D.V Poornaiah ◽

V.D. Kulkarni

Keyword(s):

High Throughput ◽

Low Cost ◽

Pipelined Architecture

Download Full-text

A Parallel and Pipelined Architecture for Accelerating Fingerprint Computation in High Throughput Data Storages

2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines ◽

10.1109/fccm.2015.43 ◽

2015 ◽

Cited By ~ 3

Author(s):

Dongyang Li ◽

Qing Yang ◽

Qingbo Wang ◽

Cyril Guyot ◽

Ashwin Narasimha ◽

...

Keyword(s):

High Throughput ◽

High Throughput Data ◽

Pipelined Architecture

Download Full-text

Towards an Optimized Architecture for Unified Binary Huff Curves

Journal of Circuits System and Computers ◽

10.1142/s021812661750178x ◽

2017 ◽

Vol 26 (11) ◽

pp. 1750178 ◽

Cited By ~ 11

Author(s):

Atif Raza Jafri ◽

Muhammad Najam ul Islam ◽

Malik Imran ◽

Muhammad Rashid

Keyword(s):

High Throughput ◽

Elliptic Curve Cryptography ◽

Power Analysis ◽

State Of The Art ◽

Area Ratio ◽

Optimal Scheduling ◽

Side Channel ◽

Pipelined Architecture ◽

Work Up ◽

Logic Unit

Applying unified formula while computing point addition and doubling provides immunity to Elliptic Curve Cryptography (ECC) against power analysis attacks (a type of side channel attack). One of the popular techniques providing this unifiedness is the Binary Huff Curves (BHC) which got attention in 2011. In this paper we are presenting highly optimized architectures to implement point multiplication (PM) on the standard NIST curves over [Formula: see text] and [Formula: see text] using BHC. To achieve a high throughput over area ratio, first of all, we have used a simplified arithmetic and logic unit. Secondly, we have reduced the time to compute PM through Double and Add algorithm. This is achieved by increasing the frequency of operation through a 2-stage pipelined architecture. The increase in clock cycles caused by consequent pipeline hazards is controlled through optimal scheduling of computations involved in PM. The synthesis results show that our designs can work up to a frequency of 377[Formula: see text]MHz on Xilinx Virtex 7 FPGA. Moreover, the overall throughput/area ratio achieved through the adopted approach is up to 20% higher while comparing with available state-of-the-art solutions.

Download Full-text

PSP: Parallel sub-pipelined architecture for high throughput AES on FPGA and ASIC

Open Computer Science ◽

10.2478/s13537-013-0112-2 ◽

2013 ◽

Vol 3 (4) ◽

Cited By ~ 9

Author(s):

K. Rahimunnisa ◽

P. Karthigaikumar ◽

N. Christy ◽

S. Kumar ◽

J. Jayakumar

Keyword(s):

High Throughput ◽

Integrated Circuit ◽

Security Applications ◽

Aes Algorithm ◽

Pipelined Architecture ◽

Field Programmable ◽

Application Specific Integrated Circuit ◽

Parallel Pipelined ◽

Application Specific ◽

Day By Day

AbstractAs the technology is growing day by day, information security plays a very important role in our lives. In order to protect the information, several cryptographic algorithms have been proposed. The aim of this paper is to present an effective Advanced Encryption Standard (AES) architecture to achieve high throughput for security applications. The Parallel Sub-Pipelined architecture (PSP) is proposed in order to obtain high throughput. The proposed architecture is also compared with loop unrolled, pipelined, sub-pipelined, parallel and parallel pipelined architecture in terms of throughput. The AES algorithm using Parallel Sub-Pipelined architecture was prototyped in FPGA (Field Programmable Gate Array) and ASIC (Application Specific Integrated Circuit).The proposed architecture yielded a throughput of 59.59 Gbps at a frequency of 450.045 MHz on FPGA Virtex XC6VLX75T which is higher than the throughput yielded in other architectures. In ASIC 0.13 µm technology, the proposed architecture yielded a throughput of 25.60 Gbps and in 0.18 µm, it yielded a throughput of 20.56 Gbps.

Download Full-text

A high-throughput pipelined architecture for JPEG XR encoding

2009 IEEE/ACM/IFIP 7th Workshop on Embedded Systems for Real-Time Multimedia ◽

10.1109/estmed.2009.5336818 ◽

2009 ◽

Cited By ~ 6

Author(s):

Koichi Hattori ◽

Hiroshi Tsutsui ◽

Hiroyuki Ochi ◽

Yukihiro Nakamura

Keyword(s):

High Throughput ◽

Pipelined Architecture ◽

Jpeg Xr

Download Full-text

A METHOD FOR PERFORMANCE MODELING AND EVALUATION OF LDPC DECODER ARCHITECTURE

International Journal of Modeling Simulation and Scientific Computing ◽

10.1142/s1793962313500037 ◽

2013 ◽

Vol 04 (02) ◽

pp. 1350003

Author(s):

TONY TSANG

Keyword(s):

High Throughput ◽

Evaluation Process ◽

Ldpc Code ◽

Simulation Method ◽

High Rate ◽

Area Network ◽

Ldpc Decoder ◽

Pipelined Architecture ◽

Performance Modeling And Evaluation ◽

Parameter Values

This paper presents a high-throughput memory efficient decoder for low density parity check (LDPC) codes in the high-rate wireless personal area network application. The novel techniques which can apply to our selected LDPC code is proposed, including parallel blocked layered decoding architecture and simplification of the WiGig networks. State-of-the-art flexible LDPC decoders cannot simultaneously achieve the high throughput mandated by these standards and the low power needed for mobile applications. This work develops a flexible, fully pipelined architecture for the IEEE 802.11ad standard capable of achieving both goals. We use Real Time–Performance Evaluation Process Algebra (RT-PEPA) to evaluate a typical LDPC Decoder system's performance. The approach is more convenient, flexible, and lower cost than the former simulation method which needs to develop special hardware and software tools. Moreover, we can easily analyze how changes in performance depend on changes in a particular mode by supplying ranges for parameter values.

Download Full-text

Low cost high throughput pipelined architecture of 2-D 8 × 8 integer transforms for H.264/AVC

International Journal of Electronics ◽

10.1080/00207217.2012.731371 ◽

2013 ◽

Vol 100 (8) ◽

pp. 1033-1045 ◽

Cited By ~ 2

Author(s):

Meeturani Sharma ◽

Honey Durga Tiwari ◽

Yong Beom Cho

Keyword(s):

High Throughput ◽

Low Cost ◽

Pipelined Architecture

Download Full-text

A high-throughput scalable BNN accelerator with fully pipelined architecture

A high-throughput pipelined architecture for blind adaptive equalizer with minimum latency

High throughput pipelined architecture for fast 2-D 4×4 forward integer transform of H.264

A Fault Tolerant Parallelism Approach for Implementing High-Throughput Pipelined Advanced Encryption Standard

High Throughput, low cost, Fully Pipelined Architecture for AES Crypto Chip

A Parallel and Pipelined Architecture for Accelerating Fingerprint Computation in High Throughput Data Storages

Towards an Optimized Architecture for Unified Binary Huff Curves

PSP: Parallel sub-pipelined architecture for high throughput AES on FPGA and ASIC

A high-throughput pipelined architecture for JPEG XR encoding

A METHOD FOR PERFORMANCE MODELING AND EVALUATION OF LDPC DECODER ARCHITECTURE

Low cost high throughput pipelined architecture of 2-D 8 × 8 integer transforms for H.264/AVC

Export Citation Format