Table-free Seed Generation for Hardware Newton–Raphson Square Root and Inverse Square Root Implementations in IoT Devices

A Givens rotation based scalable QRD core which utilizes an efficient pipelined and unfolded 2D multiply and accumulate (MAC) based systolic array architecture with dynamic partial reconfiguration (DPR) capability is proposed. The square root and inverse square root operations in the Givens rotation algorithm are handled using a modified look-up table (LUT) based Newton-Raphson method, thereby reducing the area by 71% and latency by 50% while operating at a frequency 49% higher than the existing boundary cell architectures. The proposed architecture is implemented on Xilinx Virtex-6 FPGA for any real matrices of sizem×n, where4≤n≤8andm≥nby dynamically inserting or removing the partial modules. The evaluation results demonstrate a significant reduction in latency, area, and power as compared to other existing architectures. The functionality of the proposed core is evaluated for a variable length adaptive equalizer.

Download Full-text

Optimal absolute error starting values for Newton-Raphson calculation of square root

Computing ◽

10.1007/bf02239012 ◽

1991 ◽

Vol 46 (1) ◽

pp. 67-86 ◽

Cited By ~ 5

Author(s):

P. Montuschi ◽

M. Mezzalama

Keyword(s):

Absolute Error ◽

Square Root ◽

Starting Values ◽

Newton Raphson

Download Full-text

Posit Arithmetic Hardware Implementations with The Minimum Cost Divider and SquareRoot

Electronics ◽

10.3390/electronics9101622 ◽

2020 ◽

Vol 9 (10) ◽

pp. 1622

Author(s):

Feibao Xiao ◽

Feng Liang ◽

Bin Wu ◽

Junzhe Liang ◽

Shuting Cheng ◽

...

Keyword(s):

Hardware Implementation ◽

Minimum Cost ◽

Number System ◽

Floating Point ◽

Square Root ◽

Subtraction Method ◽

Embedded Devices ◽

Hardware Implementations ◽

Newton Raphson ◽

Raphson Method

As a substitute for the IEEE 754-2008 floating-point standard, Posit, a new kind of number system for floating-point numbers, was put forward recently. Hitherto, some studies have proven that Posit is a better floating-point style than IEEE 754-2008 in some fields. However, most of these studies presented the advantages of Posit from the arithmetical aspect, but none of them suggested it had a better hardware implementation than that of IEEE 754-2008. In this paper, we propose several hardware implementations that contain the Posit adder/subtractor, multiplier, divider, and square root. Our goal is to achieve an arbitrary Posit format and exploit the minimum circuit area, which is required in embedded devices. To implement the minimum circuit area for the divider and square root, the alternating addition and subtraction method is used rather than the Newton–Raphson method. Compared with other works, the area of our divider is about 0.2×–0.7× (FPGA). Furthermore, this paper provides the synthesis results for each critical module with the Xilinx Virtex-7 FPGA VC709 platform.

Download Full-text

A scaling-less Newton-Raphson pipelined implementation for a fixed-point inverse square root operator

2017 15th IEEE International New Circuits and Systems Conference (NEWCAS) ◽

10.1109/newcas.2017.8010129 ◽

2017 ◽

Cited By ~ 2

Author(s):

Erwan Libessart ◽

Matthieu Arzel ◽

Cyril Lahuec ◽

Francesco Andriulli

Keyword(s):

Fixed Point ◽

Square Root ◽

Newton Raphson ◽

Root Operator

Download Full-text

A Modification of the Fast Inverse Square Root Algorithm

Computation ◽

10.3390/computation7030041 ◽

2019 ◽

Vol 7 (3) ◽

pp. 41 ◽

Cited By ~ 1

Author(s):

Cezary J. Walczyk ◽

Leonid V. Moroz ◽

Jan L. Cieśliński

Keyword(s):

Analytical Approach ◽

Floating Point ◽

Square Root ◽

Approximate Evaluation ◽

Single Precision ◽

Seed Solution ◽

Numerical Tests ◽

Newton Raphson ◽

Relative Errors ◽

Magic Constant

We present a new algorithm for the approximate evaluation of the inverse square root for single-precision floating-point numbers. This is a modification of the famous fast inverse square root code. We use the same “magic constant” to compute the seed solution, but then, we apply Newton–Raphson corrections with modified coefficients. As compared to the original fast inverse square root code, the new algorithm is two-times more accurate in the case of one Newton–Raphson correction and almost seven-times more accurate in the case of two corrections. We discuss relative errors within our analytical approach and perform numerical tests of our algorithm for all numbers of the type float.

Download Full-text

Genetic Improvement of Data for Maths Functions

ACM Transactions on Evolutionary Learning and Optimization ◽

10.1145/3461016 ◽

2021 ◽

Vol 1 (2) ◽

pp. 1-30

Author(s):

William B. Langdon ◽

Oliver Krauss

Keyword(s):

Open Source ◽

Genetic Improvement ◽

Root Function ◽

Cube Root ◽

Double Precision ◽

Square Root ◽

Smart Dust ◽

Code Changes ◽

Newton Raphson ◽

Binary Logarithm

We use continuous optimisation and manual code changes to evolve up to 1024 Newton-Raphson numerical values embedded in an open source GNU C library glibc square root sqrt to implement a double precision cube root routine cbrt, binary logarithm log2 and reciprocal square root function for C in seconds. The GI inverted square root x -1/2 is far more accurate than Quake’s InvSqrt, Quare root. GI shows potential for automatically creating mobile or low resource mote smart dust bespoke custom mathematical libraries with new functionality.

Download Full-text

Improving the Accuracy of the Fast Inverse Square Root by Modifying Newton–Raphson Corrections

Entropy ◽

10.3390/e23010086 ◽

2021 ◽

Vol 23 (1) ◽

pp. 86

Author(s):

Cezary J. Walczyk ◽

Leonid V. Moroz ◽

Jan L. Cieśliński

Keyword(s):

Storage Capacity ◽

Root Function ◽

Low Complexity ◽

Double Precision ◽

Square Root ◽

Direct Computation ◽

Single Precision ◽

Fast Calculation ◽

Computational Costs ◽

Newton Raphson

Direct computation of functions using low-complexity algorithms can be applied both for hardware constraints and in systems where storage capacity is a challenge for processing a large volume of data. We present improved algorithms for fast calculation of the inverse square root function for single-precision and double-precision floating-point numbers. Higher precision is also discussed. Our approach consists in minimizing maximal errors by finding optimal magic constants and modifying the Newton–Raphson coefficients. The obtained algorithms are much more accurate than the original fast inverse square root algorithm and have similar very low computational costs.

Download Full-text

Correctness proofs outline for Newton-Raphson based floating-point divide and square root algorithms

Proceedings 14th IEEE Symposium on Computer Arithmetic (Cat. No.99CB36336) ◽

10.1109/arith.1999.762834 ◽

2003 ◽

Cited By ~ 28

Author(s):

M.A. Cornea-Hasegan ◽

R.A. Golliver ◽

P. Markstein

Keyword(s):

Floating Point ◽

Square Root ◽

Correctness Proofs ◽

Newton Raphson

Download Full-text

A Modification of the Fast Inverse Square Root Algorithm

10.20944/preprints201908.0045.v1 ◽

2019 ◽

Author(s):

Cezary J. Walczyk ◽

Leonid V. Moroz ◽

Jan L. Cieśliński

Keyword(s):

Computational Cost ◽

Floating Point ◽

Square Root ◽

Single Precision ◽

Fast Calculation ◽

Newton Raphson ◽

Floating Point Numbers ◽

Improved Algorithm

We present an improved algorithm for fast calculation of the inverse square root for single-precision floating-point numbers. The algorithm is much more accurate than the famous fast inverse square root algorithm and has a similar computational cost. The presented modification concern Newton-Raphson corrections and can be applied when the distribution of these corrections is not symmetric (for instance, in our case they are always negative).

Download Full-text

Decimal Floating-Point Square Root Using Newton-Raphson Iteration

2005 IEEE International Conference on Application-Specific Systems, Architecture Processors (ASAP'05) ◽

10.1109/asap.2005.29 ◽

2006 ◽

Cited By ~ 7

Author(s):

Liang-Kai Wang ◽

M.J. Schulte

Keyword(s):

Floating Point ◽

Square Root ◽

Newton Raphson ◽

Raphson Iteration

Download Full-text