Design and Implementation of Area Efficient Single Precision Floating Point Unit

The use of reconfigurable computing for accelerating floating-point intensive codes is becoming common due to the availability of DSPs in new-generation FPGAs. We present the design of an efficient, pipelined floating-point datapath for calculating the logarithm function on reconfigurable devices. We integrate the datapath into a stand-alone LUT-based (Lookup Table) component, the LAU (Logarithm Approximation Unit). We extended the LAU, by integrating two architecturally independent, LAU-based datapaths into a larger component, the VLAU (vector-like LAU). The VLAU produces 2 results/cycle, while occupying the same amount of memory as the LAU. Under single precision, one LAU is 12 and 1.7 times faster than the GNU and Intel Math Kernel Library (MKL) implementations, respectively. The LAU is also 1.6 times faster than the FloPoCo reconfigurable logarithm architecture. Under double precision, one LAU is 20 and 2.6 times faster than the respective GNU and MKL functions and 1.4 times faster than the FloPoCo logarithm. The VLAU is approximately twice as fast as the LAU, both under single and double precision.

Download Full-text

DLX gold: design and implementation of a DLX microprocessor with single precision floating- point operations

TENCON 2007 - 2007 IEEE Region 10 Conference ◽

10.1109/tencon.2007.4429093 ◽

2007 ◽

Cited By ~ 1

Author(s):

John Edrian H. Aguilar ◽

Rosario M. Reas ◽

John Benedict B. Villangca ◽

Anastacia P. Ballesil ◽

Joy Alinda P. Reyes

Keyword(s):

Floating Point ◽

Single Precision ◽

Design And Implementation

Download Full-text

High speed and area efficient single precision floating point arithmetic unit

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) ◽

10.1109/rteict.2016.7808177 ◽

2016 ◽

Cited By ~ 2

Author(s):

Sangeeta Palekar ◽

Nitin Narkhede

Keyword(s):

High Speed ◽

Floating Point ◽

Arithmetic Unit ◽

Single Precision ◽

Floating Point Arithmetic ◽

Point Arithmetic ◽

Area Efficient

Download Full-text

An area-efficient standard-cell floating-point unit design for a processing-in-memory system

ESSCIRC 2004 - 29th European Solid-State Circuits Conference (IEEE Cat. No.03EX705) ◽

10.1109/esscirc.2003.1257071 ◽

2004 ◽

Cited By ~ 2

Author(s):

Joong-Seok Moon ◽

Taek-Jun Kwon ◽

J. Sondeen ◽

J. Draper

Keyword(s):

Memory System ◽

Floating Point ◽

Standard Cell ◽

Unit Design ◽

Floating Point Unit ◽

Area Efficient

Download Full-text

Design and Implementation of FPU for Optimised Speed

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c6444.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 3922-3933

Keyword(s):

Energy Efficient ◽

High Speed ◽

Software Tool ◽

Digital Signal ◽

Floating Point ◽

Double Precision ◽

Arithmetic Unit ◽

Single Precision ◽

Point Multiplication ◽

Floating Point Unit

Currently, each CPU has one or additional Floating Point Units (FPUs) integrated inside it. It is usually utilized in math wide-ranging applications, such as digital signal processing. It is found in places be established in engineering, medical and military fields in adding along to in different fields requiring audio, image or video handling. A high-speed and energy-efficient floating point unit is naturally needed in the electronics diligence as an arithmetic unit in microprocessors. The most operations accounting 95% of conformist FPU are multiplication and addition. Many applications need the speedy execution of arithmetic operations. In the existing system, the FPM(Floating Point Multiplication) and FPA(Floating Point Addition) have more delay and fewer speed and fewer throughput. The demand for high speed and throughput intended to design the multiplier and adder blocks within the FPM (Floating point multiplication)and FPA(Floating Point Addition) in a format of single precision floating point and double-precision floating point operation is internally pipelined to achieve high throughput and these are supported by the IEEE 754 standard floating point representations. This is designed with the Verilog code using Xilinx ISE 14.5 software tool is employed to code and verify the ensuing waveforms of the designed code

Download Full-text