iterative solvers Latest Research Papers

Assessing the robustness and scalability of the accelerated pseudo-transient method towards exascale computing

10.5194/gmd-2021-411 ◽

2022 ◽

Author(s):

Ludovic Räss ◽

Ivan Utkin ◽

Thibault Duretz ◽

Samuel Omlin ◽

Yuri Y. Podladchikov

Keyword(s):

High Performance ◽

Degrees Of Freedom ◽

Variable Viscosity ◽

Shear Banding ◽

Numerical Algorithms ◽

Nonlinear Problems ◽

Iterative Solvers ◽

Arbitrary Distribution ◽

The Road ◽

Flow Configurations

Abstract. The development of highly efficient, robust and scalable numerical algorithms lags behind the rapid increase in massive parallelism of modern hardware. We address this challenge with the accelerated pseudo-transient iterative method and present here a physically motivated derivation. We analytically determine optimal iteration parameters for a variety of basic physical processes and confirm the validity of theoretical predictions with numerical experiments. We provide an efficient numerical implementation of pseudo-transient solvers on graphical processing units (GPUs) using the Julia language. We achieve a parallel efficiency over 96 % on 2197 GPUs in distributed memory parallelisation weak scaling benchmarks. 2197 GPUs allow for unprecedented terascale solutions of 3D variable viscosity Stokes flow on 49953 grid cells involving over 1.2 trillion degrees of freedom. We verify the robustness of the method by handling contrasts up to 9 orders of magnitude in material parameters such as viscosity, and arbitrary distribution of viscous inclusions for different flow configurations. Moreover, we show that this method is well suited to tackle strongly nonlinear problems such as shear-banding in a visco-elasto-plastic medium. A GPU-based implementation can outperform CPU-based direct-iterative solvers in terms of wall-time even at relatively low resolution. We additionally motivate the accessibility of the method by its conciseness, flexibility, physically motivated derivation and ease of implementation. This solution strategy has thus a great potential for future high-performance computing applications, and for paving the road to exascale in the geosciences and beyond.

Download Full-text

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling

10.1145/3503470.3503477 ◽

2022 ◽

Author(s):

Kengo Nakajima ◽

Balazs Gerofi ◽

Masashi Horikoshi ◽

Yutaka Ishikawa

Keyword(s):

Iterative Solvers ◽

Loop Scheduling ◽

Parallel Iterative

Download Full-text

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control

10.1145/3503470.3503474 ◽

2022 ◽

Author(s):

Masashi Horikoshi ◽

Balazs Gerofi ◽

Yutaka Ishikawa ◽

Kengo Nakajima

Keyword(s):

Iterative Solvers ◽

Progress Control ◽

Parallel Iterative

Download Full-text

Low/Adaptive Precision Computation in Preconditioned Iterative Solvers for Ill-Conditioned Problems

10.1145/3492805.3492813 ◽

2022 ◽

Author(s):

Masatoshi Kawai ◽

Kengo Nakajima

Keyword(s):

Iterative Solvers ◽

Preconditioned Iterative

Download Full-text

Batched Sparse Iterative Solvers for Computational Chemistry Simulations on GPUs

10.1109/scala54577.2021.00010 ◽

2021 ◽

Author(s):

Isha Aggarwal ◽

Aditya Kashi ◽

Pratik Nayak ◽

Cody J. Balos ◽

Carol S. Woodward ◽

...

Keyword(s):

Computational Chemistry ◽

Iterative Solvers

Download Full-text

Achieving proportional fairness in WiFi networks via bandit convex optimization

annals of telecommunications - annales des télécommunications ◽

10.1007/s12243-021-00887-3 ◽

2021 ◽

Author(s):

Golshan Famitafreshi ◽

Cristina Cano

Keyword(s):

Channel Allocation ◽

Transmission Probability ◽

Iterative Solvers ◽

Proportional Fairness ◽

Estimation Errors ◽

Wifi Networks ◽

Network Parameters ◽

True Value ◽

802.11 Networks ◽

Traditional Approaches

AbstractIn this paper, we revisit proportional fair channel allocation in IEEE 802.11 networks. Traditional approaches are either based on the explicit solution of the optimization problem or use iterative solvers to converge to the optimum. Instead, we propose an algorithm able to learn the optimal slot transmission probability only by monitoring the throughput of the network. We have evaluated this algorithm both (i) using the true value of the function to optimize and (ii) considering estimation errors. We provide a comprehensive performance evaluation that includes assessing the sensitivity of the algorithm to different learning and network parameters as well as its reaction to network dynamics. We also evaluate the effect of noisy estimates on the convergence rate and propose a method to alleviate them. We believe our approach is a practical solution to improve the performance of wireless networks as it does not require knowing the network parameters in advance. Yet, we conclude that the setting of the parameters of the algorithm is crucial to guarantee fast convergence.

Download Full-text

A GPU-Accelerated Linear Solver for Massively Parallel Underground Simulations

10.2118/204011-ms ◽

2021 ◽

Author(s):

Giovanni Isotton ◽

Carlo Janna ◽

Nicoló Spiezia ◽

Omar Tosatto ◽

Massimo Bernaschi ◽

...

Keyword(s):

Large Scale ◽

Iterative Solvers ◽

Sparse Approximate Inverse ◽

Modern Engineering ◽

Numerical Experimentation ◽

Approximate Inverses ◽

Preconditioned Iterative ◽

Set Up ◽

High Level ◽

Large Scale Simulations

Abstract Modern engineering applications require the solution of linear systems of millions or even billions of equations. The solution of the linear system takes most of the simulation for large scale simulations, and represent the bottleneck in developing scientific and technical software. Usually, preconditioned iterative solvers are preferred because of their low memory requirements and they can have a high level of parallelism. Approximate inverses have been proven to be robust and effective preconditioners in several contexts. In this communication, we present an adaptive Factorized Sparse Approximate Inverse (FSAI) preconditioner with a very high level of parallelism in both set-up and application. Its inherent parallelism makes FSAI an ideal candidate for a GPU-accelerated implementation, even if taking advantage of this hardware is not a trivial task, especially in the set-up stage. An extensive numerical experimentation has been performed on industrial underground applications. It is shown that the proposed approach outperforms more traditional preconditioners in challenging underground simulation, greatly reducing time-to-solution.

Download Full-text

All-at-once multigrid approaches for one-dimensional space-fractional diffusion equations

CALCOLO ◽

10.1007/s10092-021-00436-3 ◽

2021 ◽

Vol 58 (4) ◽

Author(s):

Marco Donatelli ◽

Rolf Krause ◽

Mariarosa Mazza ◽

Ken Trotti

Keyword(s):

Diffusion Coefficients ◽

Dimensional Space ◽

Uniform Space ◽

Computational Cost ◽

Fractional Diffusion ◽

Variable Coefficients ◽

Iterative Solvers ◽

One Dimensional ◽

Good Convergence ◽

Constant Diffusion

AbstractWe focus on a time-dependent one-dimensional space-fractional diffusion equation with constant diffusion coefficients. An all-at-once rephrasing of the discretized problem, obtained by considering the time as an additional dimension, yields a large block linear system and paves the way for parallelization. In particular, in case of uniform space–time meshes, the coefficient matrix shows a two-level Toeplitz structure, and such structure can be leveraged to build ad-hoc iterative solvers that aim at ensuring an overall computational cost independent of time. In this direction, we study the behavior of certain multigrid strategies with both semi- and full-coarsening that properly take into account the sources of anisotropy of the problem caused by the grid choice and the diffusion coefficients. The performances of the aforementioned multigrid methods reveal sensitive to the choice of the time discretization scheme. Many tests show that Crank–Nicolson prevents the multigrid to yield good convergence results, while second-order backward-difference scheme is shown to be unconditionally stable and that it allows good convergence under certain conditions on the grid and the diffusion coefficients. The effectiveness of our proposal is numerically confirmed in the case of variable coefficients too and a two-dimensional example is given.

Download Full-text

Solution of a Linear Nondegenerate Matrix Equation Based on the Zero Divisor

Herald of the Bauman Moscow State Technical University Series Natural Sciences ◽

10.18698/1812-3368-2021-5-49-59 ◽

2021 ◽

pp. 49-59

Author(s):

N.E. Zubov ◽

V.N. Ryabchenko

Keyword(s):

Power Systems ◽

Matrix Equation ◽

Zero Divisor ◽

Iterative Solvers ◽

Linear Matrix ◽

Matrix Equations ◽

Proved Theorem ◽

Nondegenerate Matrix ◽

The Matrix ◽

The Right

New formulas were obtained to solve the linear non-degenerate matrix equations based on zero divisors of numerical matrices. Two theorems were formulated, and a proof to one of them is provided. It is noted that the proof of the second theorem is similar to the proof of the first one. The proved theorem substantiates new formula in solving the equation equivalent in the sense of the solution uniqueness to the known formulas. Its fundamental difference lies in the following: any explicit matrix inversion or determinant calculation is missing; solution is "based" not on the left, but on the right side of the matrix equation; zero divisor method is used (it was never used in classical formulas for solving a matrix equation); zero divisor calculation is reduced to simple operations of permutating the vector elements on the right-hand side of the matrix equation. Examples are provided of applying the proposed method for solving a nondegenerate matrix equation to the numerical matrix equations. High accuracy of the proposed formulas for solving the matrix equations is demonstrated in comparison with standard solvers used in the MATLAB environment. Similar problems arise in the synthesis of fast and ultrafast iterative solvers of linear matrix equations, as well as in nonparametric identification of abnormal (emergency) modes in complex technical systems, for example, in the power systems

Download Full-text

Time-Domain Multidimensional Deconvolution: A Physically Reliable and Stable Preconditioned Implementation

Remote Sensing ◽

10.3390/rs13183683 ◽

2021 ◽

Vol 13 (18) ◽

pp. 3683

Author(s):

David Vargas ◽

Ivan Vasconcelos ◽

Matteo Ravasi ◽

Nick Luiken

Keyword(s):

Inverse Problem ◽

Time Domain ◽

Body Wave ◽

Complex Salt ◽

Iterative Solvers ◽

Physical Constraints ◽

Damping Parameter ◽

Band Limited ◽

The Time Domain ◽

Multiple Elimination

Multidimensional deconvolution constitutes an essential operation in a variety of geophysical scenarios at different scales ranging from reservoir to crustal, as it appears in applications such as surface multiple elimination, target-oriented redatuming, and interferometric body-wave retrieval just to name a few. Depending on the use case, active, microseismic, or teleseismic signals are used to reconstruct the broadband response that would have been recorded between two observation points as if one were a virtual source. Reconstructing such a response relies on the the solution of an ill-conditioned linear inverse problem sensitive to noise and artifacts due to incomplete acquisition, limited sources, and band-limited data. Typically, this inversion is performed in the Fourier domain where the inverse problem is solved per frequency via direct or iterative solvers. While this inversion is in theory meant to remove spurious events from cross-correlation gathers and to correct amplitudes, difficulties arise in the estimation of optimal regularization parameters, which are worsened by the fact they must be estimated at each frequency independently. Here we show the benefits of formulating the problem in the time domain and introduce a number of physical constraints that naturally drive the inversion towards a reduced set of stable, meaningful solutions. By exploiting reciprocity, time causality, and frequency-wavenumber locality a set of preconditioners are included at minimal additional cost as a way to alleviate the dependency on an optimal damping parameter to stabilize the inversion. With an interferometric redatuming example, we demonstrate how our time domain implementation successfully reconstructs the overburden-free reflection response beneath a complex salt body from noise-contaminated up- and down-going transmission responses at the target level.

Download Full-text

iterative solvers
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Assessing the robustness and scalability of the accelerated pseudo-transient method towards exascale computing

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control

Low/Adaptive Precision Computation in Preconditioned Iterative Solvers for Ill-Conditioned Problems

Batched Sparse Iterative Solvers for Computational Chemistry Simulations on GPUs

Achieving proportional fairness in WiFi networks via bandit convex optimization

A GPU-Accelerated Linear Solver for Massively Parallel Underground Simulations

All-at-once multigrid approaches for one-dimensional space-fractional diffusion equations

Solution of a Linear Nondegenerate Matrix Equation Based on the Zero Divisor

Time-Domain Multidimensional Deconvolution: A Physically Reliable and Stable Preconditioned Implementation

Export Citation Format

iterative solversRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Assessing the robustness and scalability of the accelerated pseudo-transient method towards exascale computing

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control

Low/Adaptive Precision Computation in Preconditioned Iterative Solvers for Ill-Conditioned Problems

Batched Sparse Iterative Solvers for Computational Chemistry Simulations on GPUs

Achieving proportional fairness in WiFi networks via bandit convex optimization

A GPU-Accelerated Linear Solver for Massively Parallel Underground Simulations

All-at-once multigrid approaches for one-dimensional space-fractional diffusion equations

Solution of a Linear Nondegenerate Matrix Equation Based on the Zero Divisor

Time-Domain Multidimensional Deconvolution: A Physically Reliable and Stable Preconditioned Implementation

iterative solvers
Recently Published Documents