scholarly journals OP2: An active library framework for solving unstructured mesh-based applications on multi-core and many-core architectures

Author(s):  
G.R. Mudalige ◽  
M.B. Giles ◽  
I. Reguly ◽  
C. Bertolli ◽  
P.H.J Kelly
Keyword(s):  
2015 ◽  
Vol 28 (2) ◽  
pp. 557-577 ◽  
Author(s):  
I Z. Reguly ◽  
Endre László ◽  
Gihan R. Mudalige ◽  
Mike B. Giles
Keyword(s):  

2014 ◽  
Vol 77 (6) ◽  
pp. 319-333 ◽  
Author(s):  
J. Waltz ◽  
J. G. Wohlbier ◽  
L. D. Risinger ◽  
T. R. Canfield ◽  
M. R. J. Charest ◽  
...  

2020 ◽  
Vol 37 (9) ◽  
pp. 3187-3208
Author(s):  
Hongbin Liu ◽  
Hu Ren ◽  
Hanfeng Gu ◽  
Fei Gao ◽  
Guangwen Yang

Purpose The purpose of this paper is to provide an automatic parallelization toolkit for unstructured mesh-based computation. Among all kinds of mesh types, unstructured meshes are dominant in engineering simulation scenarios and play an essential role in scientific computations for their geometrical flexibility. However, the high-fidelity applications based on unstructured grids are still time-consuming, no matter for programming or running. Design/methodology/approach This study develops an efficient UNstructured Acceleration Toolkit (UNAT), which provides friendly high-level programming interfaces and elaborates lower level implementation on the target hardware to get nearly hand-optimized performance. At the present state, two efficient strategies, a multi-level blocks method and a row-subsections method, are designed and implemented on Sunway architecture. Random memory access and write–write conflict issues of unstructured meshes have been handled by partitioning, coloring and other hardware-specific techniques. Moreover, a data-reuse mechanism is developed to increase the computational intensity and alleviate the memory bandwidth bottleneck. Findings The authors select sparse matrix-vector multiplication as a performance benchmark of UNAT across different data layouts and different matrix formats. Experimental results show that the speed-ups reach up to 26× compared to single management processing element, and the utilization ratio tests indicate the capability of achieving nearly hand-optimized performance. Finally, the authors adopt UNAT to accelerate a well-tuned unstructured solver and obtain speed-ups of 19× and 10× on average for main kernels and overall solver, respectively. Originality/value The authors design an unstructured mesh toolkit, UNAT, to link the hardware and numerical algorithm, and then, engineers can focus on the algorithms and solvers rather than the parallel implementation. For the many-core processor SW26010 of the fastest supercomputer in China, UNAT yields up to 26× speed-ups and achieves nearly hand-optimized performance.


2014 ◽  
Vol E97.C (4) ◽  
pp. 360-368
Author(s):  
Takashi MIYAMORI ◽  
Hui XU ◽  
Hiroyuki USUI ◽  
Soichiro HOSODA ◽  
Toru SANO ◽  
...  
Keyword(s):  

Kerntechnik ◽  
2019 ◽  
Vol 84 (4) ◽  
pp. 262-266
Author(s):  
M. Lovecký ◽  
J. Závorka ◽  
J. Vimpel

2010 ◽  
Vol 33 (10) ◽  
pp. 1777-1787 ◽  
Author(s):  
Wei-Zhi XU ◽  
Feng-Long SONG ◽  
Zhi-Yong LIU ◽  
Dong-Rui FAN ◽  
Lei YU ◽  
...  
Keyword(s):  

2009 ◽  
Vol 31 (11) ◽  
pp. 1918-1928 ◽  
Author(s):  
Wei LIN ◽  
Xiao-Chun YE ◽  
Feng-Long SONG ◽  
Hao ZHANG
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document