A High-Speed Radix-64 Parallel Multiplier Using a Novel Hardware Implementation Approach for Partial Product Generation Based on Redundant Binary Arithmetic

This paper presents the details of hardware implementation of modified partial product reduction tree using 4:2 and 5:2 compressors. Speed of multiplication operation is improved by using higher compressors .In order to improve the speed of the multiplication process within the computational unit; there is a major bottleneck that is needed to be considered that is the partial products reduction network which is used in the multiplication block. For implementation of this stage require addition of large operands that involve long paths for carry propagation. The proposed architecture is based on binary tree constructed using modified 4:2 and 5:2 compressor circuits. Increasing the speed of operation is achieved by using higher modified compressors in critical path. Our objective of work is, to increase the speed of multiplication operation by minimizing the number of combinational gates using higher n:2 compressors. The experimental test of the proposed modified compressor is done using Spartan-3FPGA device (XC3S400 PQ-208). Using tree architectures for the partial products reduction network represent an attractive solution that is frequently applied to speed up the multiplication process. The simulation result shows 4:2 and 5:2 compressor output which is done using Questa Sim 6.4c Mentor Graphics tool.

Download Full-text

A Radix-4 Partial Product Generation-Based Approximate Multiplier for High-speed and Low-power Digital Signal Processing

2018 25th IEEE International Conference on Electronics, Circuits and Systems (ICECS) ◽

10.1109/icecs.2018.8617854 ◽

2018 ◽

Author(s):

Xiaoting Sun ◽

Yi Guo ◽

Zhenhao Liu ◽

Shinji Kimura

Keyword(s):

Signal Processing ◽

Digital Signal Processing ◽

Low Power ◽

High Speed ◽

Digital Signal ◽

Partial Product ◽

Product Generation

Download Full-text

Design and analysis of a compact fast parallel multiplier for high speed DSP applications using novel partial product generator and 4 : 2 compressor

International Journal of Electronics ◽

10.1080/00207210801915675 ◽

2008 ◽

Vol 95 (2) ◽

pp. 139-157 ◽

Cited By ~ 2

Author(s):

Subhendu Kumar Sahoo ◽

Chandra Shekhar

Keyword(s):

High Speed ◽

Partial Product ◽

Parallel Multiplier ◽

Dsp Applications

Download Full-text

A Regularly Structured Parallel Multiplier with Low-power Non-binary-logic Counter Circuits

VLSI Design ◽

10.1155/2001/97598 ◽

2001 ◽

Vol 12 (3) ◽

pp. 377-390 ◽

Cited By ~ 5

Author(s):

Rong Lin

Keyword(s):

Low Power ◽

High Performance ◽

Partial Product ◽

Recursive Decomposition ◽

Binary Logic ◽

Parallel Multiplier ◽

Low Leakage ◽

Binary Arithmetic ◽

Distinct Features ◽

Cmos Implementation

A highly regular parallel multiplier architecture along with the novel low-power, high-performance CMOS implementation circuits is presented. The superiority is achieved through utilizing a unique scheme for recursive decomposition of partial product matrices and a recently proposed non-binary arithmetic logic as well as the complementary shift switch logic circuits.The proposed 64×64-b parallel multiplier possesses the following distinct features: (1) generating 64 8×8-b partial product matrices instead of a single large one; (2) comprising only four stages of bit reductions: first, by 8×8-b small parallel multipliers, then, by small parallel counters in each of the remaining three stages. A family of shift switch parallel counters, including non-binary (6, 3)∗ and complementary (k, 2) for 2 ≤ k ≤ 8, are proposed for the efficient bit reductions; (3) using a simple final adder.The non-binary logic operates 4-bit state signals (representing integers ranging from (0 to 3), where no more than half of the signal bits are subject to value-change at any logic stage. This and others including minimum transistor counts, fewer inverters, and low-leakage logic structure, significantly reduce circuit power dissipation.

Download Full-text

HARDWARE IMPLEMENTATION AND VERIFICATION OF FIR FILTER UTILIZING M-BIT PDA

Journal of Circuits System and Computers ◽

10.1142/s0218126610006207 ◽

2010 ◽

Vol 19 (02) ◽

pp. 503-517

Author(s):

SHIANN-SHIUN JENG ◽

HSING-CHEN LIN ◽

CHUN-CHYUAN CHEN ◽

SHU-MING CHANG

Keyword(s):

Hardware Implementation ◽

Fir Filter ◽

Partial Product ◽

Distributed Arithmetic ◽

Clock Rate ◽

Parallel Multiplier

An efficient architecture for a FPGA symmetry FIR filter is proposed that employs the M-bit parallel-distributed arithmetic (M-bit PDA). The partial product is pre-calculated and saved into the distributed ROM to eliminate a large amount of multiplications. Altera Stratix II EP2S60 is used as a target device to implement the M-bit PDA. The hardware implementation requires 936 adaptive look-up tables (ALUTs), 888 registers, 1 PLL, 40960 memory bits for the FIR filter implementation with the M-bit PDA (in this case M = 2). Additionally, the maximum clock rate for this implementation can be achieved up to 155.36 MHz. In comparison with the parallel multiplier/adder cell (MAC) and serial distributed arithmetic (SDA), the proposed architecture consumes a smaller area and operates with a higher speed due to omitting the multipliers.

Download Full-text

VLSI ARCHITECTURE OF PARALLEL MULTIPLIER– ACCUMULATOR BASED ON RADIX-2 MODIFIED BOOTH ALGORITHM

International Journal of Electronics and Electical Engineering ◽

10.47893/ijeee.2012.1009 ◽

2012 ◽

pp. 40-46

Author(s):

Mr.M.V. Sathish ◽

Mrs. Sailaja

Keyword(s):

Signal Processing ◽

High Speed ◽

High Performance ◽

Vlsi Architecture ◽

Clock Frequency ◽

Parallel Multiplier ◽

Hybrid Type ◽

Standard Design ◽

Overall Performance ◽

And Performance

A new architecture of multiplier-andaccumulator (MAC) for high-speed arithmetic. By combining multiplication with accumulation and devising a hybrid type of carry save adder (CSA), the performance was improved. Since the accumulator that has the largest delay in MAC was merged into CSA, the overall performance was elevated. The proposing method CSA tree uses 1’s-complement-based radix-2 modified Booth’s algorithm (MBA) and has the modified array for the sign extension in order to increase the bit density of the operands. The proposed MAC showed the superior properties to the standard design in many ways and performance twice as much as the previous research in the similar clock frequency. We expect that the proposed MAC can be adapted to various fields requiring high performance such as the signal processing areas.

Download Full-text

Compact and High Speed Hardware Implementation of the Block- Cipher Clefia

International Journal of Computer Applications ◽

10.5120/ijca2016907937 ◽

2016 ◽

Vol 133 (8) ◽

pp. 17-20

Author(s):

V.A. Suryawanshi ◽

G.C. Manna ◽

S.S. Dorale

Keyword(s):

High Speed ◽

Hardware Implementation ◽

Block Cipher

Download Full-text

Implementation of high speed radix-10 parallel multiplier using Verilog

2015 19th International Symposium on VLSI Design and Test ◽

10.1109/isvdat.2015.7208073 ◽

2015 ◽

Author(s):

Sonam Negi ◽

Pitchaiah Madduri

Keyword(s):

High Speed ◽

Parallel Multiplier

Download Full-text

Hardware Implementation of Floating-point Operating Devices by Using IEEE-754 Binary Arithmetic Standard

2019 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) ◽

10.1109/eiconrus.2019.8656775 ◽

2019 ◽

Cited By ~ 1

Author(s):

Aung Myo San ◽

A.N. Yakunin

Keyword(s):

Hardware Implementation ◽

Floating Point ◽

Binary Arithmetic

Download Full-text

A High-Speed Radix-64 Parallel Multiplier Using a Novel Hardware Implementation Approach for Partial Product Generation Based on Redundant Binary Arithmetic

Ultra-high speed parallel multiplier with new first partial product addition algorithm

Design and Implementation of Modified Partial Product Reduction Tree for High Speed Multiplication

A Radix-4 Partial Product Generation-Based Approximate Multiplier for High-speed and Low-power Digital Signal Processing

Design and analysis of a compact fast parallel multiplier for high speed DSP applications using novel partial product generator and 4 : 2 compressor

A Regularly Structured Parallel Multiplier with Low-power Non-binary-logic Counter Circuits

HARDWARE IMPLEMENTATION AND VERIFICATION OF FIR FILTER UTILIZING M-BIT PDA

VLSI ARCHITECTURE OF PARALLEL MULTIPLIER– ACCUMULATOR BASED ON RADIX-2 MODIFIED BOOTH ALGORITHM

Compact and High Speed Hardware Implementation of the Block- Cipher Clefia

Implementation of high speed radix-10 parallel multiplier using Verilog

Hardware Implementation of Floating-point Operating Devices by Using IEEE-754 Binary Arithmetic Standard

Export Citation Format