Parallel implementation of the Lanczos method for sparse matrices

The eigenvalue/eigenvector and linear solve problems arising in computational quantum dynamics applications (e.g. rovibrational spectroscopy, reaction cross-sections, etc.) often involve large sparse matrices that exhibit a certain block structure. In such cases, specialized iterative methods that employ optimal separable basis (OSB) preconditioners (derived from a block Jacobi diagonalization procedure) have been found to be very efficient, vis-à-vis reducing the required CPU effort on serial computing platforms. Recently,1,2 a parallel implementation was introduced, based on a nonstandard domain decomposition scheme. Near-perfect parallel scalability was observed for the OSB preconditioner construction routines up to hundreds of nodes; however, the fundamental matrix–vector product operation itself was found not to scale well, in general. In addition, the number of nodes was selectively chosen, so as to ensure perfect load balancing. In this paper, two essential improvements are discussed: (1) new algorithm for the matrix–vector product operation with greatly improved parallel scalability and (2) generalization for arbitrary number of nodes and basis sizes. These improvements render the resultant parallel quantum dynamics codes suitable for robust application to a wide range of real molecular problems, running on massively parallel computing architectures.

Download Full-text

A Parallel implementation of the general Lanczos method on the CRAY T3D

Vector and Parallel Processing — VECPAR'96 - Lecture Notes in Computer Science ◽

10.1007/3-540-62828-2_119 ◽

1997 ◽

pp. 168-182

Author(s):

José Ignacio Aliagal ◽

Vicente Hernández ◽

José Luis Pérez

Keyword(s):

Parallel Implementation ◽

Lanczos Method

Download Full-text

A Parallel Implementation of the Eigenproblem for Large, Symmetric and Sparse Matrices

Recent Advances in Parallel Virtual Machine and Message Passing Interface - Lecture Notes in Computer Science ◽

10.1007/3-540-48158-3_47 ◽

1999 ◽

pp. 380-387 ◽

Cited By ~ 3

Author(s):

E. M. Garzón ◽

I. García

Keyword(s):

Parallel Implementation ◽

Sparse Matrices

Download Full-text

HLanc: Heterogeneous Parallel Implementation of the Implicitly Restarted Lanczos Method

2014 43rd International Conference on Parallel Processing Workshops ◽

10.1109/icppw.2014.60 ◽

2014 ◽

Cited By ~ 2

Author(s):

Shuai Zhang ◽

Tao Li ◽

Xiaofan Jiao ◽

Yifeng Wang ◽

Yulu Yang

Keyword(s):

Parallel Implementation ◽

Lanczos Method

Download Full-text

A 3D Finite-Difference BiCG Iterative Solver with the Fourier-Jacobi Preconditioner for the Anisotropic EIT/EEG Forward Problem

Computational and Mathematical Methods in Medicine ◽

10.1155/2014/426902 ◽

2014 ◽

Vol 2014 ◽

pp. 1-12 ◽

Cited By ~ 13

Author(s):

Sergei Turovets ◽

Vasily Volkov ◽

Aleksej Zherdetsky ◽

Alena Prakonina ◽

Allen D. Malony

Keyword(s):

Finite Difference ◽

High Efficiency ◽

Parallel Implementation ◽

Sparse Matrices ◽

Numerical Technique ◽

Three Dimensional ◽

Human Head ◽

Preconditioned Conjugate Gradient ◽

Impedance Tomography ◽

Mixed Derivatives

The Electrical Impedance Tomography (EIT) and electroencephalography (EEG) forward problems in anisotropic inhomogeneous media like the human head belongs to the class of the three-dimensional boundary value problems for elliptic equations with mixed derivatives. We introduce and explore the performance of several new promising numerical techniques, which seem to be more suitable for solving these problems. The proposed numerical schemes combine the fictitious domain approach together with the finite-difference method and the optimally preconditioned Conjugate Gradient- (CG-) type iterative method for treatment of the discrete model. The numerical scheme includes the standard operations of summation and multiplication of sparse matrices and vector, as well as FFT, making it easy to implement and eligible for the effective parallel implementation. Some typical use cases for the EIT/EEG problems are considered demonstrating high efficiency of the proposed numerical technique.

Download Full-text

Sparse matrix-vector multiplication on network-on-chip

Advances in Radio Science ◽

10.5194/ars-8-289-2010 ◽

2010 ◽

Vol 8 ◽

pp. 289-294 ◽

Cited By ~ 6

Author(s):

C.-C. Sun ◽

J. Götze ◽

H.-Y. Jheng ◽

S.-J. Ruan

Keyword(s):

Parallel Implementation ◽

Sparse Matrix ◽

Sparse Matrices ◽

Network On Chip ◽

Main Step ◽

Local Data ◽

Matrix Vector Multiplication ◽

Data Transfers ◽

On Chip ◽

Matrix Vector

Abstract. In this paper, we present an idea for performing matrix-vector multiplication by using Network-on-Chip (NoC) architecture. In traditional IC design on-chip communications have been designed with dedicated point-to-point interconnections. Therefore, regular local data transfer is the major concept of many parallel implementations. However, when dealing with the parallel implementation of sparse matrix-vector multiplication (SMVM), which is the main step of all iterative algorithms for solving systems of linear equation, the required data transfers depend on the sparsity structure of the matrix and can be extremely irregular. Using the NoC architecture makes it possible to deal with arbitrary structure of the data transfers; i.e. with the irregular structure of the sparse matrices. So far, we have already implemented the proposed SMVM-NoC architecture with the size 4×4 and 5×5 in IEEE 754 single float point precision using FPGA.

Download Full-text