Two-Level Scheduling Technology For Heterogeneous Clusters Using Analytical Hierarchy Processes

Author(s):  
Tianhai Zhao ◽  
Jianhua Gu ◽  
Xiaoyuan Zhang
Algorithms ◽  
2021 ◽  
Vol 14 (7) ◽  
pp. 204
Author(s):  
Wenpeng Ma ◽  
Wu Yuan ◽  
Xiazhen Liu

Incomplete Sparse Approximate Inverses (ISAI) has shown some advantages over sparse triangular solves on GPUs when it is used for the incomplete LU based preconditioner. In this paper, we extend the single GPU method for Block–ISAI to multiple GPUs algorithm by coupling Block–Jacobi preconditioner, and introduce the detailed implementation in the open source numerical package PETSc. In the experiments, two representative cases are performed and a comparative study of Block–ISAI on up to four GPUs are conducted on two major generations of NVIDIA’s GPUs (Tesla K20 and Tesla V100). Block–Jacobi preconditioning with Block–ISAI (BJPB-ISAI) shows an advantage over the level-scheduling based triangular solves from the cuSPARSE library for the cases, and the overhead of setting up Block–ISAI and the total wall clock times of GMRES is greatly reduced using Tesla V100 GPUs compared to Tesla K20 GPUs.


2017 ◽  
Vol 65 (7) ◽  
pp. 3782-3787 ◽  
Author(s):  
Yan Chen ◽  
Sheng Zuo ◽  
Yu Zhang ◽  
Xunwang Zhao ◽  
Huanhuan Zhang

2015 ◽  
Vol 43 (3) ◽  
pp. 43-43 ◽  
Author(s):  
Anshul Gandhi ◽  
Naman Mittal ◽  
Xi Zhang

Sign in / Sign up

Export Citation Format

Share Document