scholarly journals Implementation of multirate time integration methods for air pollution modelling

2012 ◽  
Vol 5 (6) ◽  
pp. 1395-1405 ◽  
Author(s):  
M. Schlegel ◽  
O. Knoth ◽  
M. Arnold ◽  
R. Wolke

Abstract. Explicit time integration methods are characterised by a small numerical effort per time step. In the application to multiscale problems in atmospheric modelling, this benefit is often more than compensated by stability problems and step size restrictions resulting from stiff chemical reaction terms and from a locally varying Courant-Friedrichs-Lewy (CFL) condition for the advection terms. Splitting methods may be applied to efficiently combine implicit and explicit methods (IMEX splitting). Complementarily multirate time integration schemes allow for a local adaptation of the time step size to the grid size. In combination, these approaches lead to schemes which are efficient in terms of evaluations of the right-hand side. Special challenges arise when these methods are to be implemented. For an efficient implementation, it is crucial to locate and exploit redundancies. Furthermore, the more complex programme flow may lead to computational overhead which, in the worst case, more than compensates the theoretical gain in efficiency. We present a general splitting approach which allows both for IMEX splittings and for local time step adaptation. The main focus is on an efficient implementation of this approach for parallel computation on computer clusters.

2011 ◽  
Vol 4 (4) ◽  
pp. 2937-2972 ◽  
Author(s):  
M. Schlegel ◽  
O. Knoth ◽  
M. Arnold ◽  
R. Wolke

Abstract. Explicit time integration methods are characterized by a small numerical effort per time step. In the application to multiscale problems in atmospheric modeling, this benefit is often more than compensated by stability problems and step size restrictions resulting from stiff chemical reaction terms and from a locally varying Courant-Friedrichs-Lewy (CFL) condition for the advection terms. Splitting methods may be applied to efficiently combine implicit and explicit methods (IMEX splitting). Complementarily multirate time integration schemes allow for a local adaptation of the time step size to the grid size. In combination these approaches lead to schemes which are efficient in terms of evaluations of the right hand side. Special challenges arise when these methods are to be implemented. For an efficient implementation it is crucial to locate and exploit redundancies. Furthermore the more complex program flow may lead to computational overhead which in the worst case more than compensates the theoretical gain in efficiency. We present a general splitting approach which allows both for IMEX splittings and for local time step adaptation. The main focus is on an efficient implementation of this approach for parallel computation on computer clusters.


1973 ◽  
Vol 40 (2) ◽  
pp. 417-421 ◽  
Author(s):  
R. D. Krieg

Methods of numerical time integration of the equation M¯q¨ + K¯q = f are examined in this paper. A particular class of explicit time integration methods is defined and this class is searched for an unconditionally stable method. The class is found to contain no such method and, furthermore, is found to contain no method with a larger stable time step size than that characterized by the simple central difference time integration method.


Author(s):  
Daniel S Abdi ◽  
Francis X Giraldo ◽  
Emil M Constantinescu ◽  
Lester E Carr ◽  
Lucas C Wilcox ◽  
...  

We present the acceleration of an IMplicit–EXplicit (IMEX) nonhydrostatic atmospheric model on manycore processors such as graphic processing units (GPUs) and Intel’s Many Integrated Core (MIC) architecture. IMEX time integration methods sidestep the constraint imposed by the Courant–Friedrichs–Lewy condition on explicit methods through corrective implicit solves within each time step. In this work, we implement and evaluate the performance of IMEX on manycore processors relative to explicit methods. Using 3D-IMEX at Courant number C = 15, we obtained a speedup of about 4× relative to an explicit time stepping method run with the maximum allowable C = 1. Moreover, the unconditional stability of IMEX with respect to the fast waves means the speedup can increase significantly with the Courant number as long as the accuracy of the resulting solution is acceptable. We show a speedup of 100× at C = 150 using 1D-IMEX to demonstrate this point. Several improvements on the IMEX procedure were necessary in order to outperform our results with explicit methods: (a) reducing the number of degrees of freedom of the IMEX formulation by forming the Schur complement, (b) formulating a horizontally explicit vertically implicit 1D-IMEX scheme that has a lower workload and better scalability than 3D-IMEX, (c) using high-order polynomial preconditioners to reduce the condition number of the resulting system, and (d) using a direct solver for the 1D-IMEX method by performing and storing LU factorizations once to obtain a constant cost for any Courant number. Without all of these improvements, explicit time integration methods turned out to be difficult to beat. We discuss in detail the IMEX infrastructure required for formulating and implementing efficient methods on manycore processors. Several parametric studies are conducted to demonstrate the gain from each of the abovementioned improvements. Finally, we validate our results with standard benchmark problems in numerical weather prediction and evaluate the performance and scalability of the IMEX method using up to 4192 GPUs and 16 Knights Landing processors.


2019 ◽  
Vol 54 (2) ◽  
pp. 116-129 ◽  
Author(s):  
Roberto Ortega ◽  
Geraldine Farías ◽  
Marcela Cruchaga ◽  
Matías Rivero ◽  
Mariano Vázquez ◽  
...  

The focus of this work is on the computational modeling of a pendulum made of a hyperelastic material and the corresponding experimental validation with the aim of contributing to the study of a material commonly used in seismic absorber devices. From the proposed dynamics experiment, the motion of the pendulum is recorded using a high-speed camera. The evolution of the pendulum’s positions is recovered using a capturing motion technique by tracking markers. The simulation of the problem is developed in the framework of a parallel multi-physics code. Particular emphasis is placed on the analysis of the Newmark integration scheme and the use of Rayleigh damping model. In particular, the time step size effect is analyzed. A strong time step size dependency is obtained for dissipative time integration schemes, while the Rayleigh damping formulation without time integration dissipation shows time step–independent results when convergence is achieved.


2021 ◽  
Vol 87 (1) ◽  
Author(s):  
Hendrik Ranocha ◽  
Jan Nordström

AbstractSince integration by parts is an important tool when deriving energy or entropy estimates for differential equations, one may conjecture that some form of summation by parts (SBP) property is involved in provably stable numerical methods. This article contributes to this topic by proposing a novel class of A stable SBP time integration methods which can also be reformulated as implicit Runge-Kutta methods. In contrast to existing SBP time integration methods using simultaneous approximation terms to impose the initial condition weakly, the new schemes use a projection method to impose the initial condition strongly without destroying the SBP property. The new class of methods includes the classical Lobatto IIIA collocation method, not previously formulated as an SBP scheme. Additionally, a related SBP scheme including the classical Lobatto IIIB collocation method is developed.


Author(s):  
Olivier A. Bauchau ◽  
Alexander Epple ◽  
Carlo L. Bottasso

This paper addresses practical issues associated with the numerical enforcement of constraints in flexible multibody systems, which are characterized by index-3 differential algebraic equations (DAEs). The need to scale the equations of motion is emphasized; in the proposed approach, they are scaled based on simple physical arguments, and an augmented Lagrangian term is added to the formulation. Time discretization followed by a linearization of the resulting equations leads to a Jacobian matrix that is independent of the time step size, h; hence, the condition number of the Jacobian and error propagation are both O(h0): the numerical solution of index-3 DAEs behaves as in the case of regular ordinary differential equations (ODEs). Since the scaling factor depends on the physical properties of the system, the proposed scaling decreases the dependency of this Jacobian on physical properties, further improving the numerical conditioning of the resulting linearized equations. Because the scaling of the equations is performed before the time and space discretizations, its benefits are reaped for all time integration schemes. The augmented Lagrangian term is shown to be indispensable if the solution of the linearized system of equations is to be performed without pivoting, a requirement for the efficient solution of the sparse system of linear equations. Finally, a number of numerical examples demonstrate the efficiency of the proposed approach to scaling.


Water ◽  
2021 ◽  
Vol 13 (6) ◽  
pp. 785
Author(s):  
Arman Rokhzadi ◽  
Musandji Fuamba

This paper studies the air pressurization problem caused by a partially pressurized transient flow in a reservoir-pipe system. The purpose of this study is to analyze the performance of the rigid column model in predicting the attenuation of the air pressure distribution. In this regard, an analytic formula for the amplitude and frequency will be derived, in which the influential parameters, particularly, the driving pressure and the air and water lengths, on the damping can be seen. The direct effect of the driving pressure and inverse effect of the product of the air and water lengths on the damping will be numerically examined. In addition, these numerical observations will be examined by solving different test cases and by comparing to available experimental data to show that the rigid column model is able to predict the damping. However, due to simplified assumptions associated with the rigid column model, the energy dissipation, as well as the damping, is underestimated. In this regard, using the backward Euler implicit time integration scheme, instead of the classical fourth order explicit Runge–Kutta scheme, will be proposed so that the numerical dissipation of the backward Euler implicit scheme represents the physical dissipation. In addition, a formula will be derived to calculate the appropriate time step size, by which the dissipation of the heat transfer can be compensated.


2015 ◽  
Vol 143 (9) ◽  
pp. 3838-3855 ◽  
Author(s):  
Steven Sandbach ◽  
John Thuburn ◽  
Danail Vassilev ◽  
Michael G. Duda

Abstract An important question for atmospheric modeling is the viability of semi-implicit time integration schemes on massively parallel computing architectures. Semi-implicit schemes can provide increased stability and accuracy. However, they require the solution of an elliptic problem at each time step, creating concerns about their parallel efficiency and scalability. Here, a semi-implicit (SI) version of the Model for Prediction Across Scales (MPAS) is developed and compared with the original model version, which uses a split Runge–Kutta (SRK3) time integration scheme. The SI scheme is based on a quasi-Newton iteration toward a Crank–Nicolson scheme. Each Newton iteration requires the solution of a Helmholtz problem; here, the Helmholtz problem is derived, and its solution using a geometric multigrid method is described. On two standard test cases, a midlatitude baroclinic wave and a small-planet nonhydrostatic gravity wave, the SI and SRK3 versions produce almost identical results. On the baroclinic wave test, the SI version can use somewhat larger time steps (about 60%) than the SRK3 version before losing stability. The SI version costs 10%–20% more per step than the SRK3 version, and the weak and strong scalability characteristics of the two versions are very similar for the processor configurations the authors have been able to test (up to 1920 processors). Because of the spatial discretization of the pressure gradient in the lowest model layer, the SI version becomes unstable in the presence of realistic orography. Some further work will be needed to demonstrate the viability of the SI scheme in this case.


Sign in / Sign up

Export Citation Format

Share Document