Roundoff Error and Double Precision

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

10.26434/chemrxiv.6104804.v1 ◽

2018 ◽

Author(s):

Pavel Pokhilko ◽

Evgeny Epifanovsky ◽

Anna I. Krylov

Keyword(s):

Large Scale ◽

Computation Time ◽

Coupled Cluster ◽

Double Precision ◽

Many Body ◽

Single Precision ◽

Parallel Performance ◽

Point Representation ◽

Electron Repulsion Integrals ◽

Cluster Methods

Using single precision floating point representation reduces the size of data and computation time by a factor of two relative to double precision conventionally used in electronic structure programs. For large-scale calculations, such as those encountered in many-body theories, reduced memory footprint alleviates memory and input/output bottlenecks. Reduced size of data can lead to additional gains due to improved parallel performance on CPUs and various accelerators. However, using single precision can potentially reduce the accuracy of computed observables. Here we report an implementation of coupled-cluster and equation-of-motion coupled-cluster methods with single and double excitations in single precision. We consider both standard implementation and one using Cholesky decomposition or resolution-of-the-identity of electron-repulsion integrals. Numerical tests illustrate that when single precision is used in correlated calculations, the loss of accuracy is insignificant and pure single-precision implementation can be used for computing energies, analytic gradients, excited states, and molecular properties. In addition to pure single-precision calculations, our implementation allows one to follow a single-precision calculation by clean-up iterations, fully recovering double-precision results while retaining significant savings.

Download Full-text

Genetic improvement of data gives double precision invsqrt

Proceedings of the Genetic and Evolutionary Computation Conference Companion on - GECCO '19 ◽

10.1145/3319619.3326800 ◽

2019 ◽

Author(s):

W. B. Langdon

Keyword(s):

Genetic Improvement ◽

Double Precision

Download Full-text

A Novel Rounding Algorithm for a High Performance IEEE 754 Double-Precision Floating-Point Multiplier

2020 IEEE 38th International Conference on Computer Design (ICCD) ◽

10.1109/iccd50377.2020.00081 ◽

2020 ◽

Author(s):

S. Ross Thompson ◽

James E. Stine

Keyword(s):

High Performance ◽

Floating Point ◽

Double Precision ◽

Rounding Algorithm

Download Full-text

Use of the SIMD-ization and unrolling for the calculation in double precision in a dynamic and unbalanced distribution of tasks on heterogeneous cluster systems

International Journal of Academic Research ◽

10.7813/2075-4124.2013/5-4/a.20 ◽

2013 ◽

Vol 5 (4) ◽

pp. 143-149

Author(s):

Cristian Andy Tănase ◽

Vasile Gheorghiţă Găitan

Keyword(s):

Double Precision ◽

Heterogeneous Cluster ◽

Cluster Systems

Download Full-text

The Model Order Reduction Method as an Effective Way to Implement GPC Controller for Multidimensional Objects

Algorithms ◽

10.3390/a13080178 ◽

2020 ◽

Vol 13 (8) ◽

pp. 178

Author(s):

Sebastian Plamowski ◽

Richard W Kephart

Keyword(s):

Predictive Control ◽

Model Order Reduction ◽

Order Reduction ◽

High Order ◽

Double Precision ◽

Model Order ◽

Numerical Errors ◽

Multiple Input ◽

Multidimensional Objects ◽

Order Reduction Method

The paper addresses issues associated with implementing GPC controllers in systems with multiple input signals. Depending on the method of identification, the resulting models may be of a high order and when applied to a control/regulation law, may result in numerical errors due to the limitations of representing values in double-precision floating point numbers. This phenomenon is to be avoided, because even if the model is correct, the resulting numerical errors will lead to poor control performance. An effective way to identify, and at the same time eliminate, this unfavorable feature is to reduce the model order. A method of model order reduction is presented in this paper that effectively mitigates these issues. In this paper, the Generalized Predictive Control (GPC) algorithm is presented, followed by a discussion of the conditions that result in high order models. Examples are included where the discussed problem is demonstrated along with the subsequent results after the reduction. The obtained results and formulated conclusions are valuable for industry practitioners who implement a predictive control in industry.

Download Full-text

Roundoff error analysis of fast DCT algorithms in fixed point arithmetic

Numerical Algorithms ◽

10.1007/s11075-007-9123-1 ◽

2007 ◽

Vol 46 (1) ◽

pp. 1-22 ◽

Cited By ~ 1

Author(s):

Katja Ihsberner

Keyword(s):

Fixed Point ◽

Error Analysis ◽

Roundoff Error ◽

Roundoff Error Analysis ◽

Fixed Point Arithmetic ◽

Point Arithmetic

Download Full-text

Roundoff-error reduction for evaluation of a function by polynomial approximation with error feedback in fixed-point arithmetic

IEEE Transactions on Signal Processing ◽

10.1109/78.215314 ◽

1993 ◽

Vol 41 (5) ◽

pp. 1953-1955 ◽

Cited By ~ 1

Author(s):

N. Mikami ◽

M. Kobayashi ◽

Y. Yokoyama

Keyword(s):

Fixed Point ◽

Polynomial Approximation ◽

Error Reduction ◽

Roundoff Error ◽

Error Feedback ◽

Fixed Point Arithmetic ◽

Point Arithmetic

Download Full-text

Parallel Software to Offset the Cost of Higher Precision

ACM SIGAda Ada Letters ◽

10.1145/3463478.3463483 ◽

2021 ◽

Vol 40 (2) ◽

pp. 59-64

Author(s):

Jan Verschelde

Keyword(s):

Power Series ◽

Parallel Algorithms ◽

Series Expansions ◽

Use Case ◽

Double Precision ◽

Algebraic Space ◽

Space Curves ◽

Computational Overhead ◽

Power Series Expansions ◽

The Cost

Hardware double precision is often insufficient to solve large scientific problems accurately. Computing in higher precision defined by software causes significant computational overhead. The application of parallel algorithms compensates for this overhead. Newton's method to develop power series expansions of algebraic space curves is the use case for this application.

Download Full-text

Attitude reconstruction from strap-down rate gyros using power series

Journal of Navigation ◽

10.1017/s0373463321000023 ◽

2021 ◽

pp. 1-19

Author(s):

Habib Ghanbarpourasl

Keyword(s):

Power Series ◽

Taylor Series ◽

Direction Cosine ◽

Analytical Description ◽

Double Precision ◽

Angular Velocity Vector ◽

Higher Order Terms ◽

Direction Cosine Matrix ◽

The Stability ◽

Better Than

Abstract This paper introduces a power series based method for attitude reconstruction from triad orthogonal strap-down gyros. The method is implemented and validated using quaternions and direction cosine matrix in single and double precision implementation forms. It is supposed that data from gyros are sampled with high frequency and a fitted polynomial is used for an analytical description of the angular velocity vector. The method is compared with the well-known Taylor series approach, and the stability of the coefficients’ norm in higher-order terms for both methods is analysed. It is shown that the norm of quaternions’ derivatives in the Taylor series is bigger than the equivalent terms coefficients in the power series. In the proposed method, more terms can be used in the power series before the saturation of the coefficients and the error of the proposed method is less than that for other methods. The numerical results show that the application of the proposed method with quaternions performs better than other methods. The method is robust with respect to the noise of the sensors and has a low computational load compared with other methods.

Download Full-text

Certified Roundoff Error Bounds Using Bernstein Expansions and Sparse Krivine-Stengle Representations

2017 IEEE 24th Symposium on Computer Arithmetic (ARITH) ◽

10.1109/arith.2017.36 ◽

2017 ◽

Cited By ~ 3

Author(s):

Alexandre Rocca ◽

Victor Magron ◽

Thao Dang

Keyword(s):

Error Bounds ◽

Roundoff Error

Download Full-text