Effects of mesh loop modes on performance of unstructured finite volume GPU simulations

AbstractIn unstructured finite volume method, loop on different mesh components such as cells, faces, nodes, etc is used widely for the traversal of data. Mesh loop results in direct or indirect data access that affects data locality significantly. By loop on mesh, many threads accessing the same data lead to data dependence. Both data locality and data dependence play an important part in the performance of GPU simulations. For optimizing a GPU-accelerated unstructured finite volume Computational Fluid Dynamics (CFD) program, the performance of hot spots under different loops on cells, faces, and nodes is evaluated on Nvidia Tesla V100 and K80. Numerical tests under different mesh scales show that the effects of mesh loop modes are different on data locality and data dependence. Specifically, face loop makes the best data locality, so long as access to face data exists in kernels. Cell loop brings the smallest overheads due to non-coalescing data access, when both cell and node data are used in computing without face data. Cell loop owns the best performance in the condition that only indirect access of cell data exists in kernels. Atomic operations reduced the performance of kernels largely in K80, which is not obvious on V100. With the suitable mesh loop mode in all kernels, the overall performance of GPU simulations can be increased by 15%-20%. Finally, the program on a single GPU V100 can achieve maximum 21.7 and average 14.1 speed up compared with 28 MPI tasks on two Intel CPUs Xeon Gold 6132.

Download Full-text

Effects of Mesh Loop Modes on Performance of Unstructured Finite Volume GPU Simulations

10.21203/rs.3.rs-408298/v1 ◽

2021 ◽

Author(s):

Yue Weng ◽

Xi Zhang ◽

Xiaohu Guo ◽

Xianwei Zhang ◽

Yutong Lu ◽

...

Keyword(s):

Finite Volume ◽

Data Access ◽

Data Locality ◽

Data Dependence ◽

Speed Up ◽

Computational Fluid Dynamics Cfd ◽

Overall Performance ◽

Volume Method ◽

Cell Data ◽

Unstructured Finite Volume Method

Abstract In unstructured finite volume method, loop on different mesh components such as cells, faces, nodes, etc is used widely for the traversal of data. Mesh loop results in direct or indirect data access that affects data locality significantly. By loop on mesh, many threads accessing the same data lead to data dependence. Both data locality and data dependence play an important part in the performance of GPU simulations. For optimizing a GPU-accelerated unstructured finite volume Computational Fluid Dynamics (CFD) program, the performance of hot spots under different loops on cells, faces, and nodes is evaluated on Nvidia Tesla V100 and K80. Numerical tests under different mesh scales show that the effects of mesh loop modes are different on data locality and data dependence. Specifically, face loop makes the best data locality, so long as access to face data exists in kernels. Cell loop brings the smallest overheads due to non-coalescing data access, when both cell and node data are used in computing without face data. Cell loop owns the best performance in the condition that only indirect access of cell data exists in kernels. Atomic operations reduced the performance of kernels largely in K80, which is not obvious on V100. With the suitable mesh loop mode in all kernels, the overall performance of GPU simulations can be increased by 15%-20%. Finally, the program on a single GPU V100 can achieve 4.8 speed up comparing with 28 MPI tasks on two Intel CPUs Xeon Gold 6132.

Download Full-text

An unstructured finite volume method for viscoelastic flow simulations with highly truncated domains

Journal of Non-Newtonian Fluid Mechanics ◽

10.1016/j.jnnfm.2016.01.007 ◽

2016 ◽

Vol 233 ◽

pp. 48-60 ◽

Cited By ~ 1

Author(s):

Shicheng Xue ◽

Geoffrey W. Barton

Keyword(s):

Finite Volume Method ◽

Finite Volume ◽

Viscoelastic Flow ◽

Flow Simulations ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

A Stable Unstructured Finite Volume Method with Multigrid for Parallel Large-Scale Incompressible Viscous Fluid Flow Computations

49th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition ◽

10.2514/6.2011-778 ◽

2011 ◽

Author(s):

Mehmet Sahin

Keyword(s):

Fluid Flow ◽

Viscous Fluid ◽

Finite Volume Method ◽

Finite Volume ◽

Large Scale ◽

Incompressible Viscous Fluid ◽

Viscous Fluid Flow ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

Computation of Electrical Conditions Inside a Wire-Plate Electrostatic Precipitator Using an Unstructured Finite Volume Method

2008 IEEE Industry Applications Society Annual Meeting ◽

10.1109/08ias.2008.85 ◽

2008 ◽

Author(s):

Zhengwei Long ◽

Qiang Yao ◽

Qiang Song ◽

Shuiqing Li ◽

A. Tilmatine

Keyword(s):

Finite Volume Method ◽

Finite Volume ◽

Electrostatic Precipitator ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

Application of a Riemann Solver Unstructured Finite Volume Method to Combustion Instabilities

Journal of Propulsion and Power ◽

10.2514/1.b35539 ◽

2015 ◽

Vol 31 (3) ◽

pp. 937-950 ◽

Cited By ~ 2

Author(s):

Perry L. Johnson ◽

Jared M. Pent ◽

Hrvoje Jasak ◽

J. Enrique Portillo

Keyword(s):

Finite Volume Method ◽

Finite Volume ◽

Riemann Solver ◽

Combustion Instabilities ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

A parallel adaptive unstructured finite volume method for linear stability (normal mode) analysis of viscoelastic fluid flows

Journal of Non-Newtonian Fluid Mechanics ◽

10.1016/j.jnnfm.2008.01.004 ◽

2008 ◽

Vol 155 (1-2) ◽

pp. 1-14 ◽

Cited By ~ 7

Author(s):

Mehmet Sahin ◽

Helen J. Wilson

Keyword(s):

Finite Volume Method ◽

Linear Stability ◽

Normal Mode ◽

Finite Volume ◽

Viscoelastic Fluid ◽

Normal Mode Analysis ◽

Fluid Flows ◽

Mode Analysis ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

Navier-Stokes Simulation of the MIT Flapping Foil Experiment Using an Unstructured Finite Volume Method

Volume 1: Aircraft Engine; Marine; Turbomachinery; Microturbines and Small Turbomachinery ◽

10.1115/99-gt-214 ◽

1999 ◽

Cited By ~ 4

Author(s):

Dong Jin Kang ◽

Sang Soo Bae ◽

Jae Won Kim

Keyword(s):

Boundary Layer ◽

Finite Volume Method ◽

Finite Volume ◽

Local Minimum ◽

Free Stream ◽

Navier Stokes ◽

Unsteady Boundary Layer ◽

Flapping Foil ◽

Volume Method ◽

Unstructured Finite Volume Method

A Navier-Stokes simulation of the MIT flapping foil experiment is presented. The MIT experiment was designed to provide a good quality database for unsteady boundary layer flows. The unsteady boundary layer around a hydrofoil was generated by flapping two airfoils upstream of the hydrofoil. Present Navier-Stokes simulation is carried out on the entire experimental domain, including the flapping airfoils as well as the downstream fixed hydrofoil. Present Navier-Stokes code uses an unstructured finite volume method based on the SIMPLE algorithm. It uses QUICK scheme for the convective terms and the second order Euler backward differencing for time derivatives to keep second order accuracy spatially and temporally. All other spatial derivatives are approximated by using central difference scheme. All comparisons of present time averaged and unsteady solutions with the corresponding experimental data are satisfactory: all unsteady solutions are compared in terms of time mean and first harmonic. The first harmonic of the velocity shows a peak inside the boundary layer along the surfaces of the hydrofoil and has a local minimum near the edge of the boundary layer. The local minimum becomes manifest as the boundary layer grows. The unsteadiness in the free stream is transferred inside the boundary layer when an unsteady vortex impinges on the surface. The entrained unsteadiness travels with a local velocity slower than that in the free stream. This causes phase lag of the first harmonic between the free stream and the boundary layer and local minimum of the first harmonic near the edge of the boundary layer.

Download Full-text

An Unstructured Finite-Volume Method for Structure–Electrostatics Interactions in MEMS

Numerical Heat Transfer Part B Fundamentals ◽

10.1080/10407790.2011.628252 ◽

2011 ◽

Vol 60 (6) ◽

pp. 425-451 ◽

Cited By ~ 13

Author(s):

Shankhadeep Das ◽

Sanjay R. Mathur ◽

Jayathi Y. Murthy

Keyword(s):

Finite Volume Method ◽

Finite Volume ◽

Volume Method ◽

Unstructured Finite Volume Method

Download Full-text

Multi-Mode Heat Transfer in a Randomly Packed Bed of Cylindrical Rods Using a Finite Volume Scheme

10.1115/imece2000-1573 ◽

2000 ◽

Author(s):

J. Y. Murthy ◽

S. R. Mathur

Keyword(s):

Heat Transfer ◽

Finite Volume ◽

Packed Bed ◽

Refractive Indices ◽

Finite Volume Scheme ◽

Volume Scheme ◽

Volume Method ◽

Periodic Module ◽

Multi Mode ◽

Unstructured Finite Volume Method

Abstract In this paper, calculations of mixed-mode heat transfer in beds of randomly-packed cylinders are presented. An unstructured finite volume method is employed. Random packing is addressed by meshing a periodic module, and creating the bed by stacking and random lateral translation of modules. The ability of the finite volume scheme to employ arbitrary polyhedra is exploited in addressing the resulting non-conformal interfaces. Conduction and radiation are considered, but convection is ignored. Results are presented for conducting and semi-transparent cylinders for a range of fluid and solid conductivities and solid refractive indices and establish the viability and versatility of the method.

Download Full-text