floating point unit Latest Research Papers

The reduction in energy consumption is key for deep neural networks (DNNs) to ensure usability and reliability, whether they are deployed on low-power end-nodes with limited resources or high-performance platforms that serve large pools of users. Leveraging the over-parametrization shown by many DNN models, convolutional neural networks (ConvNets) in particular, energy efficiency can be improved substantially preserving the model accuracy. The solution proposed in this work exploits the intrinsic redundancy of ConvNets to maximize the reuse of partial arithmetic results during the inference stages. Specifically, the weight-set of a given ConvNet is discretized through a clustering procedure such that the largest possible number of inner multiplications fall into predefined bins; this allows an off-line computation of the most frequent results, which in turn can be stored locally and retrieved when needed during the forward pass. Such a reuse mechanism leads to remarkable energy savings with the aid of a custom processing element (PE) that integrates an associative memory with a standard floating-point unit (FPU). Moreover, the adoption of an approximate associative rule based on a partial bit-match increases the hit rate over the pre-computed results, maximizing the energy reduction even further. Results collected on a set of ConvNets trained for computer vision and speech processing tasks reveal that the proposed associative-based hw-sw co-design achieves up to 77% in energy savings with less than 1% in accuracy loss.

Download Full-text

Design and Assertion Based Verification of RISC-V Processor Subsystems

Journal of VLSI Design and Signal Processing ◽

10.46610/jovdsp.2021.v07i03.002 ◽

2021 ◽

Vol 7 (3) ◽

Author(s):

Shruthi . ◽

Jamuna S

Keyword(s):

High Speed ◽

Branch Prediction ◽

Floating Point ◽

Processor Architecture ◽

Pipeline Architecture ◽

Description Language ◽

Instruction Fetch ◽

Hardware Description ◽

Floating Point Unit ◽

Back Stage

RISC-V is an open, free standard architecture. As its open-source architecture, it can be used in multiple applications like embedded processors, IoT, artificial intelligence, machine learning, military and defense applications. The parameters like throughput, performance, high speed etc., become essential in designing processor architecture. Pipelining is one such unique feature supported by RISC-V ISA, which basically involves the execution of multiple instructions in single cycle. This feature helps in improving the performance of the processor architecture. RISC-V ISA supports five stages of pipelining they are instruction fetch, instruction decode, execute, memory and write-back stage. The work covered in this paper involves the design and implementation of the subsystems of the RISC-V ISA which are present in different stages of pipeline architecture. The subsystems included in this work are Floating Point Unit (FPU) of Execute stage, Branch Prediction Unit (BPU) of instruction fetch stage, Forwarding Unit of execution stage, Operand Logic of decode stage and Floating-Point register file of Write-back stage. These subsystems are designed using Verilog Hardware Description Language in Xilinx ISE. Followed by the implementation the verification of the floating-point unit and the forwarding unit is performed using System Verilog Assertions in QuestaSim. The Assertion coverage report for the same is extracted.

Download Full-text

DTA-PUF: Dynamic Timing-aware Physical Unclonable Function for Resource-constrained Devices

ACM Journal on Emerging Technologies in Computing Systems ◽

10.1145/3434281 ◽

2021 ◽

Vol 17 (3) ◽

pp. 1-24

Author(s):

Ioannis Tsiokanos ◽

Jack Miskelly ◽

Chongyan Gu ◽

Maire O’neill ◽

Georgios Karakonstantis

Keyword(s):

Response Mechanism ◽

Resource Constrained ◽

Process Technology ◽

Physical Unclonable Functions ◽

Timing Errors ◽

Floating Point Unit ◽

Processing Effort ◽

Timing Behaviour ◽

Resource Constrained Devices ◽

Constrained Devices

In recent years, physical unclonable functions (PUFs) have gained a lot of attention as mechanisms for hardware-rooted device authentication. While the majority of the previously proposed PUFs derive entropy using dedicated circuitry, software PUFs achieve this from existing circuitry in a system. Such software-derived designs are highly desirable for low-power embedded systems as they require no hardware overhead. However, these software PUFs induce considerable processing overheads that hinder their adoption in resource-constrained devices. In this article, we propose DTA-PUF, a novel, software PUF design that exploits the instruction- and data-dependent dynamic timing behaviour of pipelined cores to provide a reliable challenge-response mechanism without requiring any extra hardware. DTA-PUF accepts sequences of instructions as an input challenge and produces an output response based on the manifested timing errors under specific over-clocked settings. To lower the required processing effort, we systematically select instruction sequences that maximise error-rate. The application to a post-layout pipelined floating-point unit, which is implemented in 45 nm process technology, demonstrates the effectiveness and practicability of our PUF design. Finally, DTA-PUF requires up to 50× fewer instructions than existing software processor PUF designs, limiting processing costs and resulting in up to 26% power savings.

Download Full-text

Design and Implementation of a 32-bit Floating Point Unit

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35052 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 731-736

Author(s):

Kishan Maladkar

Keyword(s):

Signal Processing ◽

Digital Signal Processing ◽

Digital Signal ◽

Floating Point ◽

Verilog Hdl ◽

Vedic Multiplier ◽

Common Operation ◽

Floating Point Unit ◽

Dsp Processors ◽

Mathematical Operations

A Floating Point Unit is a math co-processor that is in the most demand of Digital Signal Processing (DSP), Processors and more. It is used to perform functions or operations on floating point numbers like addition, subtraction, multiplication, division, square root and more. It is specifically designed to carry out mathematical operations and it can be emulated in CPU. Floating point unit is a common operation used in advanced Digital Signal Processing and various processor applications. The aim was to develop an optimized floating point unit so that the delay was reduced and efficiency was increased. The floating point unit has been written according to IEEE 754 standard and the entire design has been coded in Verilog HDL. The results are improved by 12% with the usage of Vedic multiplier that is a delay of 4.450ns as compared to 5.123ns with an array multiplier. Designs can be further optimized using low power designing techniques at architectural level. Different behaviour can be observed for different size and technologies.

Download Full-text

PERI

ACM Transactions on Architecture and Code Optimization ◽

10.1145/3446210 ◽

2021 ◽

Vol 18 (3) ◽

pp. 1-26

Author(s):

Sugandha Tiwari ◽

Neel Gala ◽

Chester Rebeiro ◽

V. Kamakoti

Keyword(s):

Computer Architecture ◽

Floating Point ◽

Instruction Set ◽

Single Precision ◽

C Programs ◽

Commercial Grade ◽

The Past ◽

Multiple Precision ◽

Execution Unit ◽

Floating Point Unit

Owing to the failure of Dennard’s scaling, the past decade has seen a steep growth of prominent new paradigms leveraging opportunities in computer architecture. Two technologies of interest are Posit and RISC-V. Posit was introduced in mid-2017 as a viable alternative to IEEE-754, and RISC-V provides a commercial-grade open source Instruction Set Architecture (ISA). In this article, we bring these two technologies together and propose a Configurable Posit Enabled RISC-V Core called PERI. The article provides insights on how the Single-Precision Floating Point (“F”) extension of RISC-V can be leveraged to support posit arithmetic. We also present the implementation details of a parameterized and feature-complete posit Floating Point Unit (FPU). The configurability and the parameterization features of this unit generate optimal hardware, which caters to the accuracy and energy/area tradeoffs imposed by the applications, a feature not possible with IEEE-754 implementation. The posit FPU has been integrated with the RISC-V compliant SHAKTI C-class core as an execution unit. To further leverage the potential of posit , we enhance our posit FPU to support two different exponent sizes (with posit-size being 32-bits), thereby enabling multiple-precision at runtime. To enable the compilation and execution of C programs on PERI, we have made minimal modifications to the GNU C Compiler (GCC), targeting the “F” extension of the RISC-V. We compare posit with IEEE-754 in terms of hardware area, application accuracy, and runtime. We also present an alternate methodology of integrating the posit FPU with the RISC-V core as an accelerator using the custom opcode space of RISC-V.

Download Full-text

The Floating-Point Unit (FPU) in the Cortex-M33 processor

Definitive Guide to Arm Cortex-M23 and Cortex-m33 Processors ◽

10.1016/b978-0-12-820735-2.00014-7 ◽

2021 ◽

pp. 519-557

Author(s):

Joseph Yiu

Keyword(s):

Floating Point ◽

Floating Point Unit

Download Full-text

Implementation of a 32 – bit RISC processor with floating point unit in FPGA platform

Journal of Physics Conference Series ◽

10.1088/1742-6596/1716/1/012047 ◽

2020 ◽

Vol 1716 ◽

pp. 012047

Author(s):

S Sushma ◽

Smruthi Koushika Ravindran ◽

Pavan Rajendar Nadagoudar ◽

P. Augusta Sophy

Keyword(s):

Floating Point ◽

Risc Processor ◽

Floating Point Unit

Download Full-text

Design and implementation of fast floating point units for FPGAs

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v19.i3.pp1480-1489 ◽

2020 ◽

Vol 19 (3) ◽

pp. 1480

Author(s):

Mohammed Falih Hassan ◽

Karime Farhood Hussein ◽

Bahaa Al-Musawi

Keyword(s):

Open Source ◽

Numerical Stability ◽

High Performance ◽

Low Cost ◽

Floating Point ◽

Design And Implementation ◽

Functional Block ◽

Open Source Framework ◽

Floating Point Unit ◽

Stability And Accuracy

<p>Due to growth in demand for high-performance applications that require high numerical stability and accuracy, the need for floating-point FPGA has been increased. In this work, an open-source and efficient floating-point unit is implemented on a standard Xilinx Sparton-6 FPGA platform. The proposed design is described in a hierarchal way starting from functional block descriptions toward modules level design. Our implementation used minimal resources available on the targeting FPGA board, tested on Sparton-6 FPGA platform and verified on ModelSim. The open-source framework can be embedded or customized for low-cost FPGA devices that do not offer floating-point units.</p>

Download Full-text

Design of a teaching computer with floating point unit for Computer Architecture

2020 XIV Technologies Applied to Electronics Teaching Conference (TAEE) ◽

10.1109/taee46915.2020.9163737 ◽

2020 ◽

Author(s):

Andres Gersnoviez ◽

Maria Brox ◽

Carlos Castillo-Marquez ◽

Miguel A. Montijano-Vizcaino ◽

Manuel A. Ortiz-Lopez ◽

...

Keyword(s):

Computer Architecture ◽

Floating Point ◽

Floating Point Unit

Download Full-text

A hardware system with ARM-based data processing for nano satellites

International Journal of Reconfigurable and Embedded Systems (IJRES) ◽

10.11591/ijres.v9.i2.pp102-108 ◽

2020 ◽

Vol 9 (2) ◽

pp. 102

Author(s):

Adrián Stacul ◽

Daniel Pastafiglia ◽

Ariel Di Giovanni ◽

Martín Morales ◽

Sergio Saluzzi ◽

...

Keyword(s):

Data Processing ◽

Inertial Sensors ◽

Floating Point ◽

Data Allocation ◽

Clock Frequency ◽

Flexible Design ◽

Dynamic Data ◽

Hardware System ◽

Floating Point Unit ◽

Maximum Clock Frequency

<span>The Institute of Scientiﬁc and Technical Research for Defense in Argentina (Instituto de Investigaciones Cientíﬁcas y Técnicas para la Defensa - CITEDEF) is developing a processing hardware module based on a ARM Cortex M4 processor from STMicroelectronics. The microcontroller (MCU) has the capacity to run at a maximum clock frequency of 180 MHz, integrates a Floating Point Unit (FPU). An 8MB SDRAM was included for dynamic data allocation. This hardware will host and process the algorithms to calculate and determine the nanosatellite’s attitude. The module is intended to be Cubesat compatible, possess a ﬂexible design, handles various inertial sensors and can manage backups on microSD memory cards with sizes up to 32GB.</span>

Download Full-text

floating point unit
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

AxP: A HW-SW Co-Design Pipeline for Energy-Efficient Approximated ConvNets via Associative Matching

Design and Assertion Based Verification of RISC-V Processor Subsystems

DTA-PUF: Dynamic Timing-aware Physical Unclonable Function for Resource-constrained Devices

Design and Implementation of a 32-bit Floating Point Unit

PERI

The Floating-Point Unit (FPU) in the Cortex-M33 processor

Implementation of a 32 – bit RISC processor with floating point unit in FPGA platform

Design and implementation of fast floating point units for FPGAs

Design of a teaching computer with floating point unit for Computer Architecture

A hardware system with ARM-based data processing for nano satellites

Export Citation Format

floating point unitRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

AxP: A HW-SW Co-Design Pipeline for Energy-Efficient Approximated ConvNets via Associative Matching

Design and Assertion Based Verification of RISC-V Processor Subsystems

DTA-PUF: Dynamic Timing-aware Physical Unclonable Function for Resource-constrained Devices

Design and Implementation of a 32-bit Floating Point Unit

PERI

The Floating-Point Unit (FPU) in the Cortex-M33 processor

Implementation of a 32 – bit RISC processor with floating point unit in FPGA platform

Design and implementation of fast floating point units for FPGAs

Design of a teaching computer with floating point unit for Computer Architecture

A hardware system with ARM-based data processing for nano satellites

floating point unit
Recently Published Documents