nvidia cuda Latest Research Papers

OuroborosBEM: a fast multi-GPU microscopic Monte Carlo simulation for gaseous detectors and charged particle dynamics

Journal of Instrumentation ◽

10.1088/1748-0221/17/01/p01020 ◽

2022 ◽

Vol 17 (01) ◽

pp. P01020

Author(s):

G. Quéméner ◽

S. Salvador

Keyword(s):

Nuclear Physics ◽

Gaseous Medium ◽

Dielectric Materials ◽

Electrostatic Problem ◽

Charge Effects ◽

Nvidia Cuda ◽

Ion Recombination ◽

Gem Detectors ◽

Gas Medium ◽

Graphical Processor

Abstract The design of gaseous detectors for accelerator, particle and nuclear physics requires simulations relying on multi-physics aspects. In fact, these simulations deal with the dynamics of a large number of charged particles interacting in a gaseous medium immersed in the electric field generated by a more or less complex assembly of electrodes and dielectric materials. We report here on a homemade software, called ouroborosbem, able to tackle the different features involved in such simulations. After solving the electrostatic problem for which a solver based on the boundary element method (BEM) has been implemented, particles are tracked and will microscopically interact with the gas medium. Dynamical effects have been included such as the electron-ion recombination process, the charging-up of the dielectric materials and other space charge effects that might alter the detector performances. These were made possible thanks to the nVidia CUDA language specifically optimised to run on Graphical Processor Units (GPUs) to minimize the computing times. Comparisons of the results obtained for parallel plate avalanche counters and GEM detectors to literature data on swarm parameters fully validate the performances of ouroborosbem. Moreover, we were able to precisely reproduce the measured gains of single and double GEM detectors as a function of the applied voltage.

Acceleration of boundary element calculations for closed domain using nonlinear form functions and CUDA technology

Doklady BGUIR ◽

10.35596/1729-7648-2021-19-14-21 ◽

2021 ◽

Vol 19 (3) ◽

pp. 14-21

Author(s):

S. S. Sherbakov ◽

M. M. Polestchuk

Keyword(s):

Boundary Element ◽

Mutual Influence ◽

Multicore Processors ◽

Boundary Elements ◽

Mechanics Of Solids ◽

Closed Domain ◽

Is Implementation ◽

Nvidia Cuda ◽

Cuda Technology ◽

Significant Acceleration

The evolution of computer technologies, as a hardware and a software parts, allows to attain fast and accurate solutions to many applied problems in scientific areas. Acceleration of calculations is broadly used technic that is basically implemented by multithreading and multicore processors. NVidia CUDA technology or simply CUDA opens a way to efficient acceleration of boundary elements method (BEM), that includes many independent stages. The main goal of the paper is implementation and acceleration of indirect boundary element method using three form functions. Calculation of the potentialdistribution inside a closed boundary under the action of the defined boundary condition is considered. In order to accelerate corresponding calculations, they were parallelized at the graphic accelerator using NVidia CUDA technology. The dependences of acceleration of parallel computations as compared with sequential ones were explored for different numbers of boundary elements and computational nodes. A significant acceleration (up to 52 times) calculation of the potential distribution without loss in accuracy is shown. Acceleration of up to 22 times was achieved in calculation of mutual influence matrix for boundary elements. Using CUDA technology allows to attain significant acceleration without loss in accuracy and convergence. So application of CUDA is a good way to parallelizing BEM. Application of developed approach allows to solve problems in different areas of physics such as acoustics, hydromechanics, electrodynamics, mechanics of solids and many other areas, efficiently.

GPU-accelerated molecular dynamics: State-of-art software performance and porting from Nvidia CUDA to AMD HIP

The International Journal of High Performance Computing Applications ◽

10.1177/10943420211008288 ◽

2021 ◽

pp. 109434202110082

Author(s):

Nikolay Kondratyuk ◽

Vsevolod Nikolskiy ◽

Daniil Pavlov ◽

Vladimir Stegailov

Keyword(s):

Molecular Dynamics ◽

High Performance ◽

Software Performance ◽

Computing Systems ◽

Accelerated Molecular Dynamics ◽

Nvidia Cuda ◽

Software And Hardware ◽

Management Capabilities ◽

Utilization Time ◽

Performance Computing

Classical molecular dynamics (MD) calculations represent a significant part of the utilization time of high-performance computing systems. As usual, the efficiency of such calculations is based on an interplay of software and hardware that are nowadays moving to hybrid GPU-based technologies. Several well-developed open-source MD codes focused on GPUs differ both in their data management capabilities and in performance. In this work, we analyze the performance of LAMMPS, GROMACS and OpenMM MD packages with different GPU backends on Nvidia Volta and AMD Vega20 GPUs. We consider the efficiency of solving two identical MD models (generic for material science and biomolecular studies) using different software and hardware combinations. We describe our experience in porting the CUDA backend of LAMMPS to ROCm HIP that shows considerable benefits for AMD GPUs comparatively to the OpenCL backend.

RECENT CAPABILITY AND PERFORMANCE ENHANCEMENTS OF THE WHOLE-CORE TRANSPORT CODE nTRACER

EPJ Web of Conferences ◽

10.1051/epjconf/202124706033 ◽

2021 ◽

Vol 247 ◽

pp. 06033

Author(s):

Namjae Choi ◽

Hansol Park ◽

Han Gyu Lee ◽

Seungug Jae ◽

Sori Jeon ◽

...

Keyword(s):

Cross Section ◽

Treatment Methods ◽

Nvidia Cuda ◽

Transport Code ◽

And Performance ◽

Offline Processing

The whole-core transport code nTRACER has made many advances in recent years. Several innovative cross section treatment methods were developed, a new axial transport solver was introduced for stabilizing the 2D/1D scheme, and substantial computational enhancements were achieved using NVIDIA CUDA and Intel Math Kernel Library (MKL). In addition, gamma transport solver was implemented to predict the power distributions more physically, and the flexibility of the restart calculation was improved using an offline processing code nTIG (nTRACER Input Generator). This paper is the compilation of the recent progresses in nTRACER developments.

Nvidia CUDA parallel processing of large FDTD meshes in a desktop computer

Proceedings of the 10th Euro-American Conference on Telematics and Information Systems ◽

10.1145/3401895.3401934 ◽

2020 ◽

Author(s):

Robert Calatayud ◽

Enrique Navarro-Modesto ◽

Enrique A. Navarro-Camba ◽

Nagula T. Sangary

Keyword(s):

Parallel Processing ◽

Desktop Computer ◽

Nvidia Cuda

COMPARISON OF GPU AND CPU EFFICIENCY WHILE SOLVING HEAT CONDUCTION PROBLEMS

Mokslas - Lietuvos ateitis ◽

10.3846/mla.2020.13500 ◽

2020 ◽

Vol 12 (0) ◽

pp. 1-5

Author(s):

Julija Semenenko ◽

Aliaksei Kolesau ◽

Vadimas Starikovičius ◽

Artūras Mackūnas ◽

Dmitrij Šešok

Keyword(s):

Heat Conduction ◽

Numerical Experiments ◽

Heat Conduction Problem ◽

Floating Point ◽

Relative Speed ◽

Nvidia Cuda ◽

Cuda Technology ◽

Engineering Problems ◽

The Matrix ◽

Speed Up

Overview of GPU usage while solving different engineering problems, comparison between CPU and GPU computations and overview of the heat conduction problem are provided in this paper. The Jacobi iterative algorithm was implemented by using Python, TensorFlow GPU library and NVIDIA CUDA technology. Numerical experiments were conducted with 6 CPUs and 4 GPUs. The fastest used GPU completed the calculations 19 times faster than the slowest CPU. On average, GPU was from 9 to 11 times faster than CPU. Significant relative speed-up in GPU calculations starts when the matrix contains at least 4002 floating-point numbers.

Aplicação de Evolução Diferencial em GPU Para o Problema de Predição de Estrutura de Proteínas com Modelo 3D AB Off-Lattice

10.5753/wscad.2020.14080 ◽

2020 ◽

Author(s):

André Dias ◽

Mateus Boiani ◽

Rafael Parpinelli

Keyword(s):

Nvidia Cuda

A função que uma proteína exerce está diretamente relacionada com a sua estrutura tridimensional. Porém, para a maior parte das proteínas atualmente sequenciadas ainda não se conhece sua forma estrutural nativa. Este artigo propõe a utilização do algoritmo de Evolução Diferencial (DE) desenvolvido na plataforma NVIDIA CUDA aplicado ao modelo 3D AB Off-Lattice para Predição de Estrutura de Proteínas. Uma estratégia de nichos e crowding foi implementada no algoritmo DE combinada com técnicas de autoajuste de parâmetros, rotinas para reinicialização da população, dois níveis de otimização e busca local. Quatro proteínas reais foram utilizadas para experimentação e os resultados obtidos se mostram competitivos com o estado-da-arte. A utilização de paralelismo massivo através da GPU ressalta a aplicabilidade desses recursos a esta classe de problemas atingindo acelerações de 708.78x para a maior cadeia proteica.

Titan: A Parallel Asynchronous Library for Multi-Agent and Soft-Body Robotics using NVIDIA CUDA

2020 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra40945.2020.9196808 ◽

2020 ◽

Author(s):

Jacob Austin ◽

Rafael Corrales-Fatou ◽

Sofia Wyetzner ◽

Hod Lipson

Keyword(s):

Soft Body ◽

Nvidia Cuda ◽

Multi Agent

Computer vision algorithms acceleration using graphic processors NVIDIA CUDA

Cluster Computing ◽

10.1007/s10586-020-03090-6 ◽

2020 ◽

Vol 23 (4) ◽

pp. 3335-3347 ◽

Cited By ~ 1

Author(s):

Mouna Afif ◽

Yahia Said ◽

Mohamed Atri

Keyword(s):

Computer Vision ◽

Nvidia Cuda

Computing of 3D Bifurcation Diagrams With Nvidia CUDA Technology

IEEE Access ◽

10.1109/access.2020.3019633 ◽

2020 ◽

Vol 8 ◽

pp. 157773-157780

Author(s):

Artur Pala ◽

Marek Machaczek

Keyword(s):

Bifurcation Diagrams ◽

Nvidia Cuda ◽

Cuda Technology

nvidia cuda
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

OuroborosBEM: a fast multi-GPU microscopic Monte Carlo simulation for gaseous detectors and charged particle dynamics

Acceleration of boundary element calculations for closed domain using nonlinear form functions and CUDA technology

GPU-accelerated molecular dynamics: State-of-art software performance and porting from Nvidia CUDA to AMD HIP

RECENT CAPABILITY AND PERFORMANCE ENHANCEMENTS OF THE WHOLE-CORE TRANSPORT CODE nTRACER

Nvidia CUDA parallel processing of large FDTD meshes in a desktop computer

COMPARISON OF GPU AND CPU EFFICIENCY WHILE SOLVING HEAT CONDUCTION PROBLEMS

Aplicação de Evolução Diferencial em GPU Para o Problema de Predição de Estrutura de Proteínas com Modelo 3D AB Off-Lattice

Titan: A Parallel Asynchronous Library for Multi-Agent and Soft-Body Robotics using NVIDIA CUDA

Computer vision algorithms acceleration using graphic processors NVIDIA CUDA

Computing of 3D Bifurcation Diagrams With Nvidia CUDA Technology

Export Citation Format

nvidia cudaRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

OuroborosBEM: a fast multi-GPU microscopic Monte Carlo simulation for gaseous detectors and charged particle dynamics

Acceleration of boundary element calculations for closed domain using nonlinear form functions and CUDA technology

GPU-accelerated molecular dynamics: State-of-art software performance and porting from Nvidia CUDA to AMD HIP

RECENT CAPABILITY AND PERFORMANCE ENHANCEMENTS OF THE WHOLE-CORE TRANSPORT CODE nTRACER

Nvidia CUDA parallel processing of large FDTD meshes in a desktop computer

COMPARISON OF GPU AND CPU EFFICIENCY WHILE SOLVING HEAT CONDUCTION PROBLEMS

Aplicação de Evolução Diferencial em GPU Para o Problema de Predição de Estrutura de Proteínas com Modelo 3D AB Off-Lattice

Titan: A Parallel Asynchronous Library for Multi-Agent and Soft-Body Robotics using NVIDIA CUDA

Computer vision algorithms acceleration using graphic processors NVIDIA CUDA

Computing of 3D Bifurcation Diagrams With Nvidia CUDA Technology

nvidia cuda
Recently Published Documents