multiple precision Latest Research Papers

Owing to the failure of Dennard’s scaling, the past decade has seen a steep growth of prominent new paradigms leveraging opportunities in computer architecture. Two technologies of interest are Posit and RISC-V. Posit was introduced in mid-2017 as a viable alternative to IEEE-754, and RISC-V provides a commercial-grade open source Instruction Set Architecture (ISA). In this article, we bring these two technologies together and propose a Configurable Posit Enabled RISC-V Core called PERI. The article provides insights on how the Single-Precision Floating Point (“F”) extension of RISC-V can be leveraged to support posit arithmetic. We also present the implementation details of a parameterized and feature-complete posit Floating Point Unit (FPU). The configurability and the parameterization features of this unit generate optimal hardware, which caters to the accuracy and energy/area tradeoffs imposed by the applications, a feature not possible with IEEE-754 implementation. The posit FPU has been integrated with the RISC-V compliant SHAKTI C-class core as an execution unit. To further leverage the potential of posit , we enhance our posit FPU to support two different exponent sizes (with posit-size being 32-bits), thereby enabling multiple-precision at runtime. To enable the compilation and execution of C programs on PERI, we have made minimal modifications to the GNU C Compiler (GCC), targeting the “F” extension of the RISC-V. We compare posit with IEEE-754 in terms of hardware area, application accuracy, and runtime. We also present an alternate methodology of integrating the posit FPU with the RISC-V core as an accelerator using the custom opcode space of RISC-V.

Download Full-text

A Reconfigurable Multiple-Precision Floating-Point Dot Product Unit for High-Performance Computing

10.23919/date51398.2021.9473928 ◽

2021 ◽

Author(s):

Wei Mao ◽

Kai Li ◽

Xinang Xie ◽

Shirui Zhao ◽

He Li ◽

...

Keyword(s):

High Performance Computing ◽

High Performance ◽

Floating Point ◽

Multiple Precision ◽

Dot Product ◽

Performance Computing

Download Full-text

Implementation of Multiple Precision Sparse Matrix-vector Multiplication on CUDA using ELLPACK Format

Journal of Physics Conference Series ◽

10.1088/1742-6596/1828/1/012013 ◽

2021 ◽

Vol 1828 (1) ◽

pp. 012013

Author(s):

Konstantin Isupov ◽

Ivan Babeshko ◽

Alexander Krutikov

Keyword(s):

Sparse Matrix ◽

Matrix Vector Multiplication ◽

Multiple Precision ◽

Matrix Vector

Download Full-text

High-Performance Computation in Residue Number System Using Floating-Point Arithmetic

Computation ◽

10.3390/computation9020009 ◽

2021 ◽

Vol 9 (2) ◽

pp. 9

Author(s):

Konstantin Isupov

Keyword(s):

Graphics Processing Units ◽

High Performance ◽

Dynamic Range ◽

Practical Interest ◽

Number System ◽

Residue Number System ◽

Floating Point ◽

Mixed Radix Conversion ◽

Multiple Precision ◽

Residue Number

Residue number system (RNS) is known for its parallel arithmetic and has been used in recent decades in various important applications, from digital signal processing and deep neural networks to cryptography and high-precision computation. However, comparison, sign identification, overflow detection, and division are still hard to implement in RNS. For such operations, most of the methods proposed in the literature only support small dynamic ranges (up to several tens of bits), so they are only suitable for low-precision applications. We recently proposed a method that supports arbitrary moduli sets with cryptographically sized dynamic ranges, up to several thousands of bits. The practical interest of our method compared to existing methods is that it relies only on very fast standard floating-point operations, so it is suitable for multiple-precision applications and can be efficiently implemented on many general-purpose platforms that support IEEE 754 arithmetic. In this paper, we make further improvements to this method and demonstrate that it can successfully be applied to implement efficient data-parallel primitives operating in the RNS domain, namely finding the maximum element of an array of RNS numbers on graphics processing units. Our experimental results on an NVIDIA RTX 2080 GPU show that for random residues and a 128-moduli set with 2048-bit dynamic range, the proposed implementation reduces the running time by a factor of 39 and the memory consumption by a factor of 13 compared to an implementation based on mixed-radix conversion.

Download Full-text

Estimating Multiple Precision Matrices with Cluster Fusion Regularization

Journal of Computational and Graphical Statistics ◽

10.1080/10618600.2021.1874963 ◽

2021 ◽

pp. 1-30

Author(s):

Bradley S. Price ◽

Aaron J. Molstad ◽

Ben Sherwood

Keyword(s):

Multiple Precision

Download Full-text

A Multiple-Precision Multiply and Accumulation Design with Multiply-Add Merged Strategy for AI Accelerating

Proceedings of the 26th Asia and South Pacific Design Automation Conference ◽

10.1145/3394885.3431531 ◽

2021 ◽

Author(s):

Song Zhang ◽

Jiangyuan Gu ◽

Shouyi Yin ◽

Leibo Liu ◽

Shaojun Wei

Keyword(s):

Multiple Precision

Download Full-text

Multiple-Precision Arithmetic of Biot-Savart Integrals for Reconnections of Vortex Filaments

10.1007/978-3-030-86976-2_13 ◽

2021 ◽

pp. 191-201

Author(s):

Yu-Hsun Lee ◽

Hiroshi Fujiwara

Keyword(s):

Vortex Filaments ◽

Multiple Precision ◽

Multiple Precision Arithmetic

Download Full-text

A Configurable Floating-Point Multiple-Precision Processing Element for HPC and AI Converged Computing

IEEE Transactions on Very Large Scale Integration (VLSI) Systems ◽

10.1109/tvlsi.2021.3128435 ◽

2021 ◽

pp. 1-14

Author(s):

Wei Mao ◽

Kai Li ◽

Quan Cheng ◽

Liuyao Dai ◽

Boyu Li ◽

...

Keyword(s):

Processing Element ◽

Floating Point ◽

Multiple Precision

Download Full-text

multiple precision
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Implementation of Coprocessor for Integer Multiple Precision Arithmetic on Zynq Ultrascale+ MPSoC

Efficient Multiple-Precision Posit Multiplier

PERI

A Reconfigurable Multiple-Precision Floating-Point Dot Product Unit for High-Performance Computing

Implementation of Multiple Precision Sparse Matrix-vector Multiplication on CUDA using ELLPACK Format

High-Performance Computation in Residue Number System Using Floating-Point Arithmetic

Estimating Multiple Precision Matrices with Cluster Fusion Regularization

A Multiple-Precision Multiply and Accumulation Design with Multiply-Add Merged Strategy for AI Accelerating

Multiple-Precision Arithmetic of Biot-Savart Integrals for Reconnections of Vortex Filaments

A Configurable Floating-Point Multiple-Precision Processing Element for HPC and AI Converged Computing

Export Citation Format

multiple precisionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Implementation of Coprocessor for Integer Multiple Precision Arithmetic on Zynq Ultrascale+ MPSoC

Efficient Multiple-Precision Posit Multiplier

PERI

A Reconfigurable Multiple-Precision Floating-Point Dot Product Unit for High-Performance Computing

Implementation of Multiple Precision Sparse Matrix-vector Multiplication on CUDA using ELLPACK Format

High-Performance Computation in Residue Number System Using Floating-Point Arithmetic

Estimating Multiple Precision Matrices with Cluster Fusion Regularization

A Multiple-Precision Multiply and Accumulation Design with Multiply-Add Merged Strategy for AI Accelerating

Multiple-Precision Arithmetic of Biot-Savart Integrals for Reconnections of Vortex Filaments

A Configurable Floating-Point Multiple-Precision Processing Element for HPC and AI Converged Computing

multiple precision
Recently Published Documents