An efficient sparse matrix-vector multiplication on CUDA-enabled graphic processing units for finite element method simulations

International Journal for Numerical Methods in Engineering ◽

10.1002/nme.5346 ◽

2016 ◽

Vol 110 (1) ◽

pp. 57-78 ◽

Author(s):

Atakan Altinkaynak

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Sparse Matrix ◽

Graphic Processing Units ◽

Matrix Vector Multiplication ◽

Matrix Vector ◽

Element Method ◽

Graphic Processing

Download Full-text

Sparse Matrix-Vector Multiplication for Finite Element Method Matrices on FPGAs

2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines ◽

10.1109/fccm.2006.65 ◽

2006 ◽

Author(s):

Yousef El-Kurdi ◽

Warren Gross ◽

Dennis Giannacopoulos

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Sparse Matrix ◽

Matrix Vector Multiplication ◽

Matrix Vector ◽

Download Full-text

FPGA architecture and implementation of sparse matrix–vector multiplication for the finite element method

Computer Physics Communications ◽

10.1016/j.cpc.2007.11.014 ◽

2008 ◽

Vol 178 (8) ◽

pp. 558-570 ◽

Author(s):

Yousef Elkurdi ◽

David Fernández ◽

Evgueni Souleimanov ◽

Dennis Giannacopoulos ◽

Warren J. Gross

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Sparse Matrix ◽

The Finite Element Method ◽

Matrix Vector Multiplication ◽

Fpga Architecture ◽

Matrix Vector ◽

Download Full-text

Finite-Element Sparse Matrix Vector Multiplication on Graphic Processing Units

IEEE Transactions on Magnetics ◽

10.1109/tmag.2010.2043511 ◽

2010 ◽

Vol 46 (8) ◽

pp. 2982-2985 ◽

Author(s):

Maryam Mehri Dehnavi ◽

David M. Fernandez ◽

Dennis Giannacopoulos

Keyword(s):

Finite Element ◽

Sparse Matrix ◽

Graphic Processing Units ◽

Matrix Vector Multiplication ◽

Matrix Vector ◽

Graphic Processing

Download Full-text

A Memory Reduction Scheme for Parallel Finite Element Method Sparse Matrix Equation Solving

2018 International Conference on Microwave and Millimeter Wave Technology (ICMMT) ◽

10.1109/icmmt.2018.8563809 ◽

2018 ◽

Author(s):

WuJie Mei ◽

Yang Bai ◽

Sheng Zuo ◽

Daniel Garcia Donoro ◽

Yu Zhang

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Matrix Equation ◽

Sparse Matrix ◽

Reduction Scheme ◽

Equation Solving ◽

Parallel Finite Element ◽

Memory Reduction ◽

Download Full-text

GPU-Friendly Preconditioners for Efficient 3-D Finite Element Analysis of Thin Structures

Volume 2: 31st Computers and Information in Engineering Conference, Parts A and B ◽

10.1115/detc2011-47330 ◽

2011 ◽

Author(s):

Vikalp Mishra ◽

Krishnan Suresh

Keyword(s):

Finite Element Analysis ◽

Finite Element ◽

Sparse Matrix ◽

Grid Method ◽

Double Precision ◽

Thin Structures ◽

Element Analysis ◽

Dual Representation ◽

Matrix Vector Multiplication ◽

A serious computational bottle-neck in finite element analysis today is the solution of the underlying system of equations. To alleviate this problem, researchers have proposed the use of graphics programmable units (GPU) for fast iterative solution of such equations. Indeed, researchers have shown that a GPU-implementation of a double-precision sparse-matrix-vector multiplication (that underlies all iterative methods) is approximately an order of magnitude faster than that of an optimized CPU implementation. Unfortunately, fast matrix-vector multiplication alone is insufficient… a good preconditioner is necessary for rapid convergence. Furthermore, most modern preconditioners, such as incomplete Cholesky, are expensive to compute, and cannot be easily ported to the GPU. In this paper, we propose a special class of preconditioners for the analysis of thin structures, such as beams and plates. The proposed preconditioners are developed by combining the multi-grid method, with recently developed dual-representation method for thin structures. It is shown, that these preconditioners are computationally inexpensive, perform better than standard pre-conditioners, and can be easily ported to the GPU.

Download Full-text

A Novel CSR-Based Sparse Matrix-Vector Multiplication on GPUs

Mathematical Problems in Engineering ◽

10.1155/2016/8471283 ◽

2016 ◽

Vol 2016 ◽

pp. 1-12 ◽

Author(s):

Guixia He ◽

Jiaquan Gao

Keyword(s):

Sparse Matrix ◽

Sparse Matrices ◽

Poor Performance ◽

Test Results ◽

Graphic Processing Units ◽

Multiple Gpus ◽

Matrix Vector Multiplication ◽

Compressed Sparse Row ◽

Access Patterns ◽

Sparse matrix-vector multiplication (SpMV) is an important operation in scientific computations. Compressed sparse row (CSR) is the most frequently used format to store sparse matrices. However, CSR-based SpMVs on graphic processing units (GPUs), for example, CSR-scalar and CSR-vector, usually have poor performance due to irregular memory access patterns. This motivates us to propose a perfect CSR-based SpMV on the GPU that is called PCSR. PCSR involves two kernels and accesses CSR arrays in a fully coalesced manner by introducing a middle array, which greatly alleviates the deficiencies of CSR-scalar (rare coalescing) and CSR-vector (partial coalescing). Test results on a single C2050 GPU show that PCSR fully outperforms CSR-scalar, CSR-vector, and CSRMV and HYBMV in the vendor-tuned CUSPARSE library and is comparable with a most recently proposed CSR-based algorithm, CSR-Adaptive. Furthermore, we extend PCSR on a single GPU to multiple GPUs. Experimental results on four C2050 GPUs show that no matter whether the communication between GPUs is considered or not PCSR on multiple GPUs achieves good performance and has high parallel efficiency.

Download Full-text

Comparison of GPU-Based Parallel Assembly and Assembly-Free Sparse Matrix Vector Multiplication for Finite Element Analysis of Three-Dimensional Structures

Proceedings of the Fifteenth International Conference on Civil, Structural and Environmental Engineering Computing ◽

10.4203/ccp.108.222 ◽

2015 ◽

Author(s):

A. Akbariyeh ◽

B.H. Dennis ◽

B.P. Wang ◽

K.L. Lawrence

Keyword(s):

Finite Element Analysis ◽

Finite Element ◽

Sparse Matrix ◽

Three Dimensional ◽

Element Analysis ◽

Matrix Vector Multiplication ◽

Parallel Assembly ◽

Download Full-text

A new sparse matrix vector multiplication graphics processing unit algorithm designed for finite element problems

International Journal for Numerical Methods in Engineering ◽

10.1002/nme.4865 ◽

2015 ◽

Vol 102 (12) ◽

pp. 1784-1814 ◽

Author(s):

J. Wong ◽

E. Kuhl ◽

E. Darve

Keyword(s):

Finite Element ◽

Graphics Processing Unit ◽

Sparse Matrix ◽

Processing Unit ◽

Matrix Vector Multiplication ◽

Graphics Processing ◽

Download Full-text

Fast sparse matrix-vector multiplication on graphics processing unit for finite element analysis

2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems ◽

10.1109/hpcc.2012.193 ◽

2012 ◽

Author(s):

Abal-Kassim Cheik Ahamed ◽

Frederic Magoules

Keyword(s):

Finite Element Analysis ◽

Finite Element ◽

Graphics Processing Unit ◽

Sparse Matrix ◽

Processing Unit ◽

Element Analysis ◽

Matrix Vector Multiplication ◽

Graphics Processing ◽

Download Full-text

Sparse Matrix Aspects of the Finite Element Method

Lecture Notes in Economics and Mathematical Systems - Computing Methods in Applied Sciences and Engineering ◽

10.1007/978-3-642-85972-4_1 ◽

1976 ◽

pp. 3-22 ◽

Author(s):

Alan George

Keyword(s):

Finite Element Method ◽

Finite Element ◽

Sparse Matrix ◽

The Finite Element Method ◽

Download Full-text