scholarly journals FMM-Yukawa: An adaptive fast multipole method for screened Coulomb interactions

2009 ◽  
Vol 180 (11) ◽  
pp. 2331-2338 ◽  
Author(s):  
Jingfang Huang ◽  
Jun Jia ◽  
Bo Zhang
2010 ◽  
Vol 181 (12) ◽  
pp. 2206-2207 ◽  
Author(s):  
Bo Zhang ◽  
Jingfang Huang ◽  
Nikos P. Pitsianis ◽  
Xiaobai Sun

2016 ◽  
Vol 20 (2) ◽  
pp. 534-550 ◽  
Author(s):  
Bo Zhang ◽  
Jingfang Huang ◽  
Nikos P. Pitsianis ◽  
Xiaobai Sun

AbstractWe present RECFMM, a program representation and implementation of a recursive scheme for parallelizing the adaptive fast multipole method (FMM) on shared-memory computers. It achieves remarkable high performance while maintaining mathematical clarity and flexibility. The parallelization scheme signifies the recursion feature that is intrinsic to the FMM but was not well exploited. The program modules of RECFMM constitute a map between numerical computation components and advanced architecture mechanisms. The mathematical structure is preserved and exploited, not obscured nor compromised, by parallel rendition of the recursion scheme. Modern software system—CILK in particular, which provides graph-theoretic optimal scheduling in adaptation to the dynamics in parallel execution—is employed. RECFMM supports multiple algorithm variants that mark the major advances with low-frequency interaction kernels, and includes the asymmetrical version where the source particle ensemble is not necessarily the same as the target particle ensemble. We demonstrate parallel performance with Coulomb and screened Coulomb interactions.


2011 ◽  
Vol 230 (15) ◽  
pp. 5807-5821 ◽  
Author(s):  
Bo Zhang ◽  
Jingfang Huang ◽  
Nikos P. Pitsianis ◽  
Xiaobai Sun

1992 ◽  
Vol 278 ◽  
Author(s):  
Steven R. Lustig ◽  
J.J. Cristy ◽  
D.A. Pensak

AbstractThe fast multipole method (FMM) is implemented in canonical ensemble particle simulations to compute non-bonded interactions efficiently with explicit error control. Multipole and local expansions have been derived to implement the FMM efficiently in Cartesian coordinates for soft-sphere (inverse power law), Lennard- Jones, Morse and Yukawa potential functions. Significant reductions in execution times have been achieved with respect to the direct method. For a given number, N, of particles the execution times of the direct method scale asO(N2). The FMM execution times scale asO(N) on sequential workstations and vector processors and asymptotically0(logN) on massively parallel computers. Connection Machine CM-2 and WAVETRACER-DTC parallel FMM implementations execute faster than the Cray-YMP vectorized FMM for ensemble sizes larger than 28k and 35k, respectively. For 256k particle ensembles the CM-2 parallel FMM is 12 times faster than the Cray-YMP vectorized direct method and 2.2 times faster than the vectorized FMM. For 256k particle ensembles the WAVETRACER-DTC parallel FMM is 33 times faster than the Cray-YMP vectorized direct method.


Sign in / Sign up

Export Citation Format

Share Document