Performance Evaluation Model for Matrix Calculation on GPU

Author(s):  
Mengjia Yin ◽  
Xianbin Xu ◽  
Tao Zhang ◽  
Conghuan Ye

Establishment of a performance evaluation model is a hotspot of current research. In this paper, the performance bottleneck is analyzed quantitatively, which provided programmers with a guidance to optimize the performance bottleneck. This paper takes a matrix as an example; the matrix is divided into a dense matrix or a sparse matrix. For dense matrix, the performance is first analyzed in a quantitative way, and an evaluation model is developed, which includes the instruction pipeline, shared memory, and global memory. For sparse matrix, this paper aims at the four formats of CSR, ELL, COO, and HYB, through the observation data obtained from the actual operation of large datasets, finds the relationship between the running time, dataset form, and storage model, and establishes their relational model functions. Through practical test and comparison, the error between the execution time of the test dataset that is predicted by the model function and the actual running time is found to be within a stable finite deviation threshold, proving that the model has certain practicability.

OPSEARCH ◽  
2019 ◽  
Vol 56 (1) ◽  
pp. 14-31 ◽  
Author(s):  
Shiva Moslemi ◽  
Hamidreza Izadbakhsh ◽  
Marzieh Zarinbal

Sign in / Sign up

Export Citation Format

Share Document