An FPGA implementation of high speed and area efficient double-precision floating point multiplier using Urdhva Tiryagbhyam technique

:Floating point arithmetic plays a major role in scientific and embedded computing applications. But the performance of field programmable gate arrays (FPGAs) used for floating point applications is poor due to the complexity of floating point arithmetic. The implementation of floating point units on FPGAs consumes a large amount of resources and that leads to the development of embedded floating point units in FPGAs. Embedded applications like multimedia, communication and DSP algorithms use floating point arithmetic in processing graphics, Fourier transformation, coding, etc. In this paper, methodologies are presented for the implementation of embedded floating point units on FPGA. The work is focused with the aim of achieving high speed of computations and to reduce the power for evaluating expressions. An application that demands high performance floating point computation can achieve better speed and density by incorporating embedded floating point units. Additionally this paper describes a comparative study of the design of single precision and double precision pipelined floating point arithmetic units for evaluating expressions. The modules are designed using VHDL simulation in Xilinx software and implemented on VIRTEX and SPARTAN FPGAs.

Download Full-text

High-speed, area-efficient FPGA-based floating-point multiplier

Proceedings of the 12th IEEE International Conference on Fuzzy Systems (Cat. No.03CH37442) ◽

10.1109/icm.2003.237828 ◽

2003 ◽

Cited By ~ 1

Author(s):

G.H.A. Aty ◽

A.I. Hussein ◽

I.S. Ashour ◽

M. Mones

Keyword(s):

High Speed ◽

Floating Point ◽

Area Efficient

Download Full-text

Memory Compact High-Speed QC-LDPC Decoder Based on FPGA

Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University ◽

10.1051/jnwpu/20193730515 ◽

2019 ◽

Vol 37 (3) ◽

pp. 515-522

Author(s):

Tianjiao Xie ◽

Bo Li ◽

Mao Yang ◽

Zhongjiang Yan

Keyword(s):

High Speed ◽

Clock Cycle ◽

Fpga Implementation ◽

Maximum Frequency ◽

Ldpc Decoder ◽

Memory Strategies ◽

Multiple Data ◽

Decoder Architecture ◽

Place And Route ◽

Area Efficient

In this paper, two compact memory strategies for partially parallel QC-LDPC decoder architecture are proposed. By compacting several adjacent rows hard decisions and extrinsic messages into one memory entry, which not only reduces the number of memory banks for hard decisions, but also facilitates multiple data accesses per clock cycle so as to increase the throughput of decoder. We demonstrate significant high speed and area efficient benefits of using the proposed techniques with an FPGA implementation of a CCSDS LDPC decoder on Xilinx XC5VLX330 device. The result shows that our new decoder can operate at a maximum frequency of 250 MHz after place and route, and achieve a throughput up to 2 Gb/s at 14 iterations.

Download Full-text

FPC: A High-Speed Compressor for Double-Precision Floating-Point Data

IEEE Transactions on Computers ◽

10.1109/tc.2008.131 ◽

2009 ◽

Vol 58 (1) ◽

pp. 18-31 ◽

Cited By ~ 107

Author(s):

Martin Burtscher ◽

Paruj Ratanaworabhan

Keyword(s):

High Speed ◽

Floating Point ◽

Double Precision ◽

Point Data

Download Full-text

Area-Efficient Architecture for Dual-Mode Double Precision Floating Point Division

IEEE Transactions on Circuits and Systems I Regular Papers ◽

10.1109/tcsi.2016.2607227 ◽

2017 ◽

Vol 64 (2) ◽

pp. 386-398 ◽

Cited By ~ 4

Author(s):

Manish Kumar Jaiswal ◽

Hayden K.-H. So

Keyword(s):

Floating Point ◽

Double Precision ◽

Dual Mode ◽

Area Efficient

Download Full-text

A pipelined area-efficient and high-speed reconfigurable processor for floating-point FFT/IFFT and DCT/IDCT computations

Microelectronics Journal ◽

10.1016/j.mejo.2015.11.004 ◽

2016 ◽

Vol 47 ◽

pp. 19-30 ◽

Cited By ~ 3

Author(s):

Mingyu Wang ◽

Fang Wang ◽

Shaojun Wei ◽

Zhaolin Li

Keyword(s):

High Speed ◽

Floating Point ◽

Reconfigurable Processor ◽

Area Efficient

Download Full-text

A design of high speed double precision floating point adder using macro modules

Proceedings of the ASP-DAC 2005. Asia and South Pacific Design Automation Conference, 2005. ◽

10.1109/aspdac.2005.1466603 ◽

2005 ◽

Author(s):

Chi Huang ◽

Xinyu Wu ◽

Jinmei Lai ◽

Chengshou Sun ◽

Gang Li

Keyword(s):

High Speed ◽

Floating Point ◽

Double Precision

Download Full-text

High speed and area efficient single precision floating point arithmetic unit

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) ◽

10.1109/rteict.2016.7808177 ◽

2016 ◽

Cited By ~ 2

Author(s):

Sangeeta Palekar ◽

Nitin Narkhede

Keyword(s):

High Speed ◽

Floating Point ◽

Arithmetic Unit ◽

Single Precision ◽

Floating Point Arithmetic ◽

Point Arithmetic ◽

Area Efficient

Download Full-text

Design and Implementation of FPU for Optimised Speed

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c6444.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 3922-3933

Keyword(s):

Energy Efficient ◽

High Speed ◽

Software Tool ◽

Digital Signal ◽

Floating Point ◽

Double Precision ◽

Arithmetic Unit ◽

Single Precision ◽

Point Multiplication ◽

Floating Point Unit

Currently, each CPU has one or additional Floating Point Units (FPUs) integrated inside it. It is usually utilized in math wide-ranging applications, such as digital signal processing. It is found in places be established in engineering, medical and military fields in adding along to in different fields requiring audio, image or video handling. A high-speed and energy-efficient floating point unit is naturally needed in the electronics diligence as an arithmetic unit in microprocessors. The most operations accounting 95% of conformist FPU are multiplication and addition. Many applications need the speedy execution of arithmetic operations. In the existing system, the FPM(Floating Point Multiplication) and FPA(Floating Point Addition) have more delay and fewer speed and fewer throughput. The demand for high speed and throughput intended to design the multiplier and adder blocks within the FPM (Floating point multiplication)and FPA(Floating Point Addition) in a format of single precision floating point and double-precision floating point operation is internally pipelined to achieve high throughput and these are supported by the IEEE 754 standard floating point representations. This is designed with the Verilog code using Xilinx ISE 14.5 software tool is employed to code and verify the ensuing waveforms of the designed code

Download Full-text