electron repulsion integrals Latest Research Papers

<div><div><div><p>We report a new multi-GPU capable ab initio Hartree-Fock/density functional theory implementation integrated into the open source QUantum Interaction Computational Kernel (QUICK) program. Details on the load balancing algorithms for electron repulsion integrals and exchange correlation quadrature across multiple GPUs are described. Benchmarking studies carried out on up to 4 GPU nodes, each containing 4 NVIDIA V100-SMX2 type GPUs demonstrate that our implementation is capable of achiev- ing excellent load balancing and high parallel efficiency. For representative medium to large size protein/organic molecular sys- tems, the observed efficiencies remained above 86%. The accelerations on NVIDIA A100, P100 and K80 platforms also have real- ized parallel efficiencies higher than 74%, paving the way for large-scale ab initio electronic structure calculations.</p></div></div></div>

Download Full-text

Harnessing the Power of Multi-GPU Acceleration into the Quantum Interaction Computational Kernel Program

10.26434/chemrxiv.13769209.v1 ◽

2021 ◽

Author(s):

Madushanka Manathunga ◽

Chi Jin ◽

Vinicius Cruzeiro ◽

Yipu Miao ◽

Dawei Mu ◽

...

Keyword(s):

Load Balancing ◽

Ab Initio ◽

Density Functional ◽

Large Scale ◽

Functional Theory ◽

Multiple Gpus ◽

Hartree Fock ◽

Large Size ◽

Electron Repulsion Integrals ◽

Computational Kernel

<div><div><div><p>We report a new multi-GPU capable ab initio Hartree-Fock/density functional theory implementation integrated into the open source QUantum Interaction Computational Kernel (QUICK) program. Details on the load balancing algorithms for electron repulsion integrals and exchange correlation quadrature across multiple GPUs are described. Benchmarking studies carried out on up to 4 GPU nodes, each containing 4 NVIDIA V100-SMX2 type GPUs demonstrate that our implementation is capable of achiev- ing excellent load balancing and high parallel efficiency. For representative medium to large size protein/organic molecular sys- tems, the observed efficiencies remained above 86%. The accelerations on NVIDIA A100, P100 and K80 platforms also have real- ized parallel efficiencies higher than 74%, paving the way for large-scale ab initio electronic structure calculations.</p></div></div></div>

Download Full-text

Evaluating two-electron-repulsion integrals over arbitrary orbitals using zero variance Monte Carlo: Application to full configuration interaction calculations with Slater-type orbitals

The Journal of Chemical Physics ◽

10.1063/1.5114703 ◽

2019 ◽

Vol 151 (6) ◽

pp. 064101

Author(s):

Michel Caffarel

Keyword(s):

Monte Carlo ◽

Configuration Interaction ◽

Slater Type Orbitals ◽

Slater Type ◽

Full Configuration Interaction ◽

Electron Repulsion ◽

Electron Repulsion Integrals ◽

Configuration Interaction Calculations

Download Full-text

Implementation of analytic gradients for CCSD and EOM-CCSD using Cholesky decomposition of the electron-repulsion integrals and their derivatives: Theory and benchmarks

The Journal of Chemical Physics ◽

10.1063/1.5100022 ◽

2019 ◽

Vol 151 (1) ◽

pp. 014110 ◽

Cited By ~ 4

Author(s):

Xintian Feng ◽

Evgeny Epifanovsky ◽

Jürgen Gauss ◽

Anna I. Krylov

Keyword(s):

Cholesky Decomposition ◽

Electron Repulsion ◽

Electron Repulsion Integrals

Download Full-text

An efficient algorithm for Cholesky decomposition of electron repulsion integrals

The Journal of Chemical Physics ◽

10.1063/1.5083802 ◽

2019 ◽

Vol 150 (19) ◽

pp. 194112 ◽

Cited By ~ 12

Author(s):

Sarai D. Folkestad ◽

Eirik F. Kjønstad ◽

Henrik Koch

Keyword(s):

Efficient Algorithm ◽

Cholesky Decomposition ◽

Electron Repulsion ◽

Electron Repulsion Integrals

Download Full-text

The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package

The International Journal of High Performance Computing Applications ◽

10.1177/1094342019845046 ◽

2019 ◽

Vol 34 (2) ◽

pp. 159-168

Author(s):

Xinming Qin ◽

Honghui Shang ◽

Lei Xu ◽

Wei Hu ◽

Jinlong Yang ◽

...

Keyword(s):

Distributed Algorithms ◽

Density Functional ◽

Materials Science ◽

Parallel Implementation ◽

Computational Cost ◽

Ab Initio Simulations ◽

Communication Time ◽

Electron Repulsion Integrals ◽

Distribution Algorithms ◽

Hybrid Density Functional

Hybrid density-functional calculation is one of the most commonly adopted electronic structure theories in computational chemistry and materials science because of its balance between accuracy and computational cost. Recently, we have developed a novel scheme called NAO2GTO to achieve linear scaling (Order-N) calculations for hybrid density-functionals. In our scheme, the most time-consuming step is the calculation of the electron repulsion integrals (ERIs) part, so creating an even distribution of these ERIs in parallel implementation is an issue of particular importance. Here, we present two static scalable distributed algorithms for the ERIs computation. Firstly, the ERIs are distributed over ERIs shell pairs. Secondly, the ERIs are distributed over ERIs shell quartets. In both algorithms, the calculation of ERIs is independent of each other, so the communication time is minimized. We show our speedup results to demonstrate the performance of these static parallel distributed algorithms in the Hefei Order-N packages for ab initio simulations.

Download Full-text

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

10.26434/chemrxiv.6104804.v1 ◽

2018 ◽

Author(s):

Pavel Pokhilko ◽

Evgeny Epifanovsky ◽

Anna I. Krylov

Keyword(s):

Large Scale ◽

Computation Time ◽

Coupled Cluster ◽

Double Precision ◽

Many Body ◽

Single Precision ◽

Parallel Performance ◽

Point Representation ◽

Electron Repulsion Integrals ◽

Cluster Methods

Using single precision floating point representation reduces the size of data and computation time by a factor of two relative to double precision conventionally used in electronic structure programs. For large-scale calculations, such as those encountered in many-body theories, reduced memory footprint alleviates memory and input/output bottlenecks. Reduced size of data can lead to additional gains due to improved parallel performance on CPUs and various accelerators. However, using single precision can potentially reduce the accuracy of computed observables. Here we report an implementation of coupled-cluster and equation-of-motion coupled-cluster methods with single and double excitations in single precision. We consider both standard implementation and one using Cholesky decomposition or resolution-of-the-identity of electron-repulsion integrals. Numerical tests illustrate that when single precision is used in correlated calculations, the loss of accuracy is insignificant and pure single-precision implementation can be used for computing energies, analytic gradients, excited states, and molecular properties. In addition to pure single-precision calculations, our implementation allows one to follow a single-precision calculation by clean-up iterations, fully recovering double-precision results while retaining significant savings.

Download Full-text

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

10.26434/chemrxiv.6104804 ◽

2018 ◽

Author(s):

Pavel Pokhilko ◽

Evgeny Epifanovsky ◽

Anna I. Krylov

Keyword(s):

Large Scale ◽

Computation Time ◽

Coupled Cluster ◽

Double Precision ◽

Many Body ◽

Single Precision ◽

Parallel Performance ◽

Point Representation ◽

Electron Repulsion Integrals ◽

Cluster Methods

Using single precision floating point representation reduces the size of data and computation time by a factor of two relative to double precision conventionally used in electronic structure programs. For large-scale calculations, such as those encountered in many-body theories, reduced memory footprint alleviates memory and input/output bottlenecks. Reduced size of data can lead to additional gains due to improved parallel performance on CPUs and various accelerators. However, using single precision can potentially reduce the accuracy of computed observables. Here we report an implementation of coupled-cluster and equation-of-motion coupled-cluster methods with single and double excitations in single precision. We consider both standard implementation and one using Cholesky decomposition or resolution-of-the-identity of electron-repulsion integrals. Numerical tests illustrate that when single precision is used in correlated calculations, the loss of accuracy is insignificant and pure single-precision implementation can be used for computing energies, analytic gradients, excited states, and molecular properties. In addition to pure single-precision calculations, our implementation allows one to follow a single-precision calculation by clean-up iterations, fully recovering double-precision results while retaining significant savings.

Download Full-text

An efficient MPI/OpenMP parallelization of the Hartree–Fock–Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture

The International Journal of High Performance Computing Applications ◽

10.1177/1094342017732628 ◽

2017 ◽

Vol 33 (1) ◽

pp. 212-224 ◽

Cited By ~ 5

Author(s):

Vladimir Mironov ◽

Alexander Moskovsky ◽

Michael D’Mello ◽

Yuri Alexeev

Keyword(s):

Message Passing ◽

Message Passing Interface ◽

First Generation ◽

Hot Spot ◽

Direct Consequence ◽

Xeon Phi ◽

Hartree Fock ◽

Self Consistent Field ◽

Speed Up ◽

Electron Repulsion Integrals

The Hartree–Fock method in the General Atomic and Molecular Structure System (GAMESS) quantum chemistry package represents one of the most irregular algorithms in computation today. Major steps in the calculation are the irregular computation of electron repulsion integrals and the building of the Fock matrix. These are the central components of the main self consistent field (SCF) loop, the key hot spot in electronic structure codes. By threading the Message Passing Interface (MPI) ranks in the official release of the GAMESS code, we not only speed up the main SCF loop (4× to 6× for large systems) but also achieve a significant ([Formula: see text]×) reduction in the overall memory footprint. These improvements are a direct consequence of memory access optimizations within the MPI ranks. We benchmark our implementation against the official release of the GAMESS code on the Intel® Xeon Phi™ supercomputer. Scaling numbers are reported on up to 7680 cores on Intel Xeon Phi coprocessors.

Download Full-text

electron repulsion integrals
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Toward the Minimal Floating Operation Count Cholesky Decomposition of Electron Repulsion Integrals

Harnessing the Power of Multi-GPU Acceleration into the Quantum Interaction Computational Kernel Program

Harnessing the Power of Multi-GPU Acceleration into the Quantum Interaction Computational Kernel Program

Evaluating two-electron-repulsion integrals over arbitrary orbitals using zero variance Monte Carlo: Application to full configuration interaction calculations with Slater-type orbitals

Implementation of analytic gradients for CCSD and EOM-CCSD using Cholesky decomposition of the electron-repulsion integrals and their derivatives: Theory and benchmarks

An efficient algorithm for Cholesky decomposition of electron repulsion integrals

The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

An efficient MPI/OpenMP parallelization of the Hartree–Fock–Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture

Export Citation Format

electron repulsion integralsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Toward the Minimal Floating Operation Count Cholesky Decomposition of Electron Repulsion Integrals

Harnessing the Power of Multi-GPU Acceleration into the Quantum Interaction Computational Kernel Program

Harnessing the Power of Multi-GPU Acceleration into the Quantum Interaction Computational Kernel Program

Evaluating two-electron-repulsion integrals over arbitrary orbitals using zero variance Monte Carlo: Application to full configuration interaction calculations with Slater-type orbitals

Implementation of analytic gradients for CCSD and EOM-CCSD using Cholesky decomposition of the electron-repulsion integrals and their derivatives: Theory and benchmarks

An efficient algorithm for Cholesky decomposition of electron repulsion integrals

The static parallel distribution algorithms for hybrid density-functional calculations in HONPAS package

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

Double Precision Is Not Needed for Many-Body Calculations: New Conventional Wisdom

An efficient MPI/OpenMP parallelization of the Hartree–Fock–Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture

electron repulsion integrals
Recently Published Documents