scholarly journals Dual tree traversal on integrated GPUs for astrophysical N-body simulations

Author(s):  
Pierre Fortin ◽  
Maxime Touche

In astrophysical N-body simulations, O( N) fast multipole methods (FMMs) with dual tree traversal (DTT) on multi-core CPUs are faster than O( N log N) CPU tree-codes but can still be outperformed by GPU ones. In this article, we aim at combining the best algorithm, namely FMM with DTT, with the most powerful hardware currently available, namely GPUs. In the astrophysical context requiring low accuracies and non-uniform particle distributions, we show that such combination can be achieved thanks to a hybrid CPU-GPU algorithm on integrated GPUs: while the DTT is performed on the CPU cores, the far- and near-field computations are all performed on the GPU cores. We show how to efficiently expose the interactions resulting from the DTT to the GPU cores, how to deploy both the far- and near-field computations on GPU, and how to overlap the parallel DTT on CPU with GPU computations. Based on the falcON code and using OpenCL on AMD Accelerated Processing Units and on Intel integrated GPUs, this first heterogeneous deployment of DTT for FMM outperforms standard multi-core CPUs and matches GPU and high-end CPU performance, being hence more cost- and power-efficient.

2013 ◽  
Vol 13 (1) ◽  
pp. 107-128 ◽  
Author(s):  
Bo Zhang ◽  
Benzhuo Lu ◽  
Xiaolin Cheng ◽  
Jingfang Huang ◽  
Nikos P. Pitsianis ◽  
...  

AbstractThis paper summarizes the mathematical and numerical theories and computational elements of the adaptive fast multipole Poisson-Boltzmann (AFMPB) solver. We introduce and discuss the following components in order: the Poisson-Boltzmann model, boundary integral equation reformulation, surface mesh generation, the nodepatch discretization approach, Krylov iterative methods, the new version of fast multipole methods (FMMs), and a dynamic prioritization technique for scheduling parallel operations. For each component, we also remark on feasible approaches for further improvements in efficiency, accuracy and applicability of the AFMPB solver to large-scale long-time molecular dynamics simulations. The potential of the solver is demonstrated with preliminary numerical results.


2006 ◽  
Vol 32 (10-11) ◽  
pp. 775-790 ◽  
Author(s):  
J. Kurzak ◽  
B. M. Pettitt

2017 ◽  
Vol 62 (8) ◽  
pp. 400-402
Author(s):  
A. M. Linkov ◽  
E. Rejwer ◽  
L. Rybarska-Rusinek

1998 ◽  
Vol 5 (3) ◽  
pp. 32-38 ◽  
Author(s):  
L. Greengard ◽  
Jingfang Huang ◽  
V. Rokhlin ◽  
S. Wandzura

Sign in / Sign up

Export Citation Format

Share Document